Asymptotic behavior of random determinants in 
the Laguerre, Gram and Jacobi ensembles 

Alain Rouault 

LMV Batiment Fermat, Universite Versailles-Saint-Quentin, F-78035 Versailles, France 
E-mail address: Alain.RouaultOmath.uvsq.fr 

Abstract. We consider properties of determinants of some random symmetric ma- 
trices issued from multivariate statistics: Wishart/Laguerre ensemble (sample co- 
variance matrices), Uniform Gram ensemble (sample correlation matrices) and Ja- 
cobi ensemble (MANOVA). If n is the size of the sample, r < n the number of 
variates and X n>r such a matrix, a generalization of the Bartlett-type theorems 
gives a decomposition of det X n ^ r into a product of r independent gamma or beta 
random variables. For n fixed, we study the evolution as r grows, and then take 
the limit of large r and n with r/n — t < 1. We derive limit theorems for the 
sequence of processes with independent increments {n _1 log det X n , t G [0, T]} n 
for T < 1 : convergence in probability, invariance principle, large deviations. Since 
the logarithm of the determinant is a linear statistic of the empirical spectral dis- 
tribution, we connect the results for marginals (fixed t) with those obtained by the 
spectral method. Actually, all the results hold true for log gases or f3 models, if 
we define the determinant as the product of charges. The classical matrix models 
(real, complex, and quaternionic) correspond to the particular values f3 — 1, 2, 4 of 
the Dyson parameter. 



1. Introduction 

Random determinants of symmetric matrices are of constant use in random 
geometry to compute volumes of parallelotopes (see Nielsen (1999), Mathai (1999)) 
and in multivariate statistics to build tests. Twenty years after the book of Girko 
(1988), recent developments in Random Matrix Theory add a new interest to the 
study of their asymptotical behavior and invite to a new insight. 

Let B = [bi, . . . , b r ] be the n x r matrix with r column vectors bi, . . . , b r of 1™. 
If B' denotes its transpose, the determinant of the r x r Gram matrix B'B satisfies 
the well known Hadamard inequality : 

det B'B < ||&i|| 2 ---||M 2 (1-1) 

with equality if and only if b\, . . . , b r are orthogonal (Hadamard, 1893). It means 
that the volume (or r-content) of the parallelotope built from b\ , . . . , b r is maximal 



2000 Mathematics Subject Classification. Primary 15 A 52, 15 A 15; Secondary 60F 10, 60F 
17, 62 H 10. 

Key words and phrases. Random matrices, Wishart ensemble, Laguerre ensemble, Jacobi en- 
semble, Gram ensemble, Hadamard ratio, determinant, invariance principle, large deviations. 



2 



Alain Rouault 



when the vectors are orthogonal. The quantity 

dctB'B 

h{B) = IWI 2 -IIM 2 

is usually called the Hadamard ratio. If we replace sequentially bi by its projec- 
tion bi on the orthogonal of the subspace spanned by &i, . . . , b»_i (Gram-Schmidt 
orthogonalization), we have 

r 

dctB'B = H\\b l \\ 2 . 

1=1 

Motivated by basis reduction problems, Schnorr (1986) defined the orthogonality 
defect as the quantity 1/ yJh(B) (see also Akhavi (2002) and references therein). 
Abbott and Mulders (2001) and Dixon (1984) are concerned with the tightness of 
the bound h(B) < 1 when B is random and n~r. For these authors, the random 
vectors bi are sampled independently and uniformly on the unit sphere 

§l = {(x 1 ,...,x n )eR n :x 2 1+ --- + xl = 1}. 

It is known that then the variables ||&i|| 2 are independent and Beta distributed 
with varying parameters. When the entries of the matrix B are independent and 
7V"(0, 1), the variables ||6i|| 2 are independent and Gamma distributed with varying 
parameters (Bartlctt, 1933). 

Writing B n>r instead of B to stress on dimensions and X n>r = B' nr B n ^ r , we are 
interested in this paper in the asymptotic behavior of det X n>r when n and r both 
tends to infinity in the regime r/n — > c £ [0, 1]. Since the construction of the bi 
is recursive, it is possible (for fixed n) to consider the whole sequence of variables 
{det X nir , r — 1, . . . , n} at the same time. 

It corresponds to the decomposition of the determinant of a symmetric positive 
matrix A as 

, , A det AM 
11 det Ab-i] 

where is the j x j upper-left corner of A with the convention detAi ! = 1. 
When using this approach, we will refer to it as the decomposition method. 

The decomposition method is also valid when entries of the matrix are complex, 
considering the Hermitian conjugate B* and then B*B, and also when the entries 
are real quaternions, considering the dual B^ and then B^B. 

In these three cases, a Bartlett-type theorem gives the determinant as a product 
of independent variables, with Gamma or Beta distributions. Passing to logarithms, 
it is then possible to consider a triangular array of variables and a process with 
independent increments log det X n ^ nt j , t e [0,T]}„ for T < 1 indexed by the 
"time" t — r/n. Thanks to the additive structure of the log det, we obtained limit 
theorems : convergence in probability, invariance principle and large deviations. 

The same is true for random matrices following the Jacobi (or MANOVA) dis- 
tribution. Actually, the whole construction is possible in the so-called /3-models, 
which are an extension of the above ones, which correspond to the three-fold way 
(3 = 1,2,4 of Dyson. For other values of (3 they are not defined as matrix models 
but log gases models, in which the eigenvalues are replaced by charges and deter- 
minants by products of charges. It has be shown recently that they correspond 
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also to models of tri-diagonal random matrices (see Dumitriu and Edclman (2003), 
Killip and Nenciu (2004), Edclman and Sutton (2007)). 

Of course, for r fixed, there is also another underlying structure of product: the 
determinant as the product of eigenvalues. We may use the asymptotical behavior 
of empirical spectral distributions, i.e. convergence to the Marcenko-Pastur distri- 
bution in the Wishart/Laguerre case and to the generalized McKay distribution in 
the Jacobi case. However, this structure is not "dynamic": if you change r, the 
whole set of eigenvalues is changing. When using this approach, we will refer to it 
as the spectral method. 

In Section 2 we set the framework. We begin with the matrix models (Wishart- 
Laguerre, Uniform Gram and Jacobi), and proceed with the /?- models and processes 
of determinants. 

The main results of this paper are in Section 3: laws of large numbers and 
fluctuations, large deviations and variational problems. The comparison of results 
obtained by the two methods (decomposition and spectral) deserves interest and 
is the topic of Section 4. Some extensions to other models are given in Subsection 
4.4. 

Sections 5, 6 and 7 are devoted to the proofs. In Appendix 1 we gather some 
details on the Binct formula on the Gamma function which are of constant use in 
this paper, and Appendix 2 gives identification of the McKay distribution. 



2. Notation and known facts 

In this long section, we present our different models whose common feature is 
to introduce processes of random determinants with independent multiplicative 
factors. The distribution of these factors are recorded in Proposition 2.1 for real 
matrix models, and settled in formulae (2.2), (2.3) and (2.4) for the (other) (3- 
models. 

Throughout, \A\ stands for det A, and /„ for the n x n identity matrix. If X, Y 
are real random variables and [i a distribution on R, we write 

X ( ^Y (rcsp. X = fi) 
if X and Y have the same distribution (resp. if the distribution of X is /i). 



2.1. Real matrix models and Bartlett-type theorems. In the basic model, we consider 
independent random vectors bi,i > 1 with the same distribution v n in R n . The 
most important example is the Gaussian one with v n — A/"(0, /„). If B = [bi, . . . , b r ], 
all the entries of B are independent A/"(0, 1) and the distribution of W — B'B is 
denoted by W r (n, M) and called the Wishart ensemble. For r < n, its density on 
the space S r of symmetric positive matrix is 

1 | W \{n-r-l)/2 ( _ 1 1 w \ 

2™/2r r (n/2) 1 1 V 2 J 

where T r is the multivariate Gamma function 

r r (a) = ir^-^r^r^ _ I) . . . r(« - r -^) , 
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It is the matrix variate extension of the Gamma distribution. Recall that for 
a, c > 0, the Gammafa, c) distribution has density 

~ x e~ cx (x>0). 



T(a) 

For r > n, the matrix is singular. 

Motivated by Hadamard inequality (1.1), we may choose v. n to be the uniform 
distribution on the unit sphere Sg. The corresponding ensemble for B is called 
Uniform Spherical Ensemble by Donoho and Tsaig (2005). The matrix ensemble 
for B'B is called the Gram ensemble by De Cock, Fannes, and Spincemaille (1999), 
since B'B is the Gram matrix built from the bi's. To stress on the distribution, 
we call it Uniform Gram ensemble. The diagonal entries are one and for r < n, 
the joint density of the non-diagonal entries (r^, 1 < i < j < r) of the matrix 
G = B'B is 

MlCI— >" (-1 <,-„<!> (2,) 

(see Gupta and Nagar (2000) Theorem 3.3.24 p.107, Mathai (1993) Example 1.25 
and Mathai (1997) p.58). 

Let us now introduce Jacobi ensembles. For ni,n 2 > 1 and r < n := ri\ + n 2 , 
we can decompose every (ni + n 2 ) x r matrix M in two blocks 

(My 

\M 2 



M = 



with M\ of type n\ x r and M 2 of type n 2 x r. If the entries of M are independent 
W(0, 1), then TUi := M[M X and IU 2 := M' 2 Mi are independent Wishart matrices of 
distribution IU r (ni,IR) and lU r (n 2 ,R), respectively. It is well known that W\ + W 2 
is W r (ni + n 2 ,IR) distributed and a.s. invertible. Let us denote by (W\ + W^) 1 / 2 
the symmetric positive square root of (Wi + IU 2 ). The r x r matrix 

x ■= (Wx + w 2 )- 1/2 w 1 (w 1 + w 2 y 1/2 

has a distribution denoted by J r (m , n 2 , M) and called the Jacobi ensemble. 

If T is upper triangular with positive diagonal entries and TUi + W 2 = T'T 
(Cholesky decomposition) then 

Z = (T'^WiT' 1 

is also J r (ni,n 2 ,M) distributed, (see Olkin and Rubin (1964), Muirhead (1982) 
p.108). 

Another occurrence of the Jacobi ensemble is interesting (see Doumerc (2005), 
Collins (2005)). If M is as above, its singular value decomposition is 



M = UDV , D = 



with D of type nxr, with A diagonal with nonnegative entries, with U € 0(n) and 
V € 0(r) (the orthogonal groups). Although U and V are not uniquely determined, 
one can choose them according to the Haar distribution on their respective group 
and such that U, V, A are independent. Then M'M = V'A 2 V and 

[W x + W 2 ) 1/2 = [M'M) 1 ' 2 = V'AV . 
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Let Y r = U^ ni ' r ^ be the ni x r upper-left corner of U. Since Mi = Y r AV we have 
M[M Y = V'AY r 'Y r AV = {MM*) 1/2 {V'Y; Y rV)(MM*) 1/2 

and then X = (Y r V)'(Y r V) ( = } Y r 'Y r . In other words, 

y := ([/'[™ 1 ' r ])'[/[™ 1 '^ 

is also J r (n 1; n 2 , R) distributed. 

If r < min(n 1 ,n 2 ), the distribution J r (n l7 n 2 ,IR) has a density on <S r which is 

r A ^ |z| ^^ |/r - z|: ^ io <^ ' (2 - 2) 

"12 ' 2 / 

where 

r r (a)r r (&) 
/Ma ' &j ~ r r (a + 6) ' 
(see for example Muirhcad (1982) Theorem 3.3.1). It is the matrix variate extension 
of the beta distribution. Recall that for a > 0, b > 0, the Beta(a, b) distribution 
has density 

r(a + b) a _i, , fn n\ 

fW(bj x {1 - x) (x>0) - (2 - 3) 

Until now, we had r fixed. Our purpose is now to consider all values of r 
simultaneously to give a "sample path" study of determinants. 
For an n x n matrix B = [b\, . . . , b n ], we have for r < n 

(B'B) [r] = (^B [n ' r] y B [n ' r] , 
and for every j < n, the quantity 

. \(B>B)W\ 
p3 ' n \(B>B)ti-i]\ 

is a measurable function of (&i , . . . , bj ) and 

\(B'B)W\ = f[p j>n . (2.5) 

The same occurs with 6, := 6j/||6j|| instead of bi (i = 1, . . . , n) and B := [&i, . . . , 6„] 
instead of £>. Let us note that = 1 and 

p- ^ lWU]l 7-2 n (26) 

so that 

|(B'B)W| = JJ ?J .„. (2.7) 

The Wishart case and the Uniform Gram case corresponds to (2.5) and (2.7) 
respectively, for r — 1, . . . , n. 

In the Jacobi case, r e {1, . . . , n{\. If M = [b\, . . . , b ni ] , and if T, W\, Z are 
defined as above with n\ instead of r, then 
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For every j, the quantity 



z \Z^\ 



Pj,n u n 2 : |2b-l]| 

is a measurable function of (61 , ... , bj ) and 

\Z [r] \ = I[ptn 1 ,n 2 - 
3=1 

It can be noticed that 

z „ \w l r 1] + wt 1] \ 

Pi 



'™ \w[ j] + w^\ \wt 1] \ 

Besides, the construction with the symmetric square root is different. If 

x \XW\ 

Pj,ni,n 2 ■ 

we have 

± (wY ] + w^)~ V2 wt ] (w[ r] + w£ ] )~ 1/2 ■ 

(Take m = n 2 = 2, W x = I 2 , W 2 = ( 1 , ] and r = 1 then 



(w[ 1] + wf ] ) ~ 1/2 w[ r] (W[ r] + W\ r] ) ^ = 2/(4 - s 2 ) , 

and X^ = 1/2). Moreover we cannot say that P* nij „ 2 is measurable with respect 
to 61, . . . , bj . 

Let us consider the construction from contraction of Haar matrices. Since 

, [ni,r] 

we see that the quantity 



^[/["liy^ylm]^ ni ' r _ ^y[ni,r]y jj[n u r] 

\yw\ 



pl 



2J ' \y^-^\ 

depends only on the j first columns of the matrix U, and 



\y [r] \ = l[p- 



y 

ni ,ri2 ,j 

3=1 



It is possible to introduce a probability space on which all Uniform Gram and 
Wishart matrices are defined for all values of n simultaneously. It is enough to 
consider the infinite product space generated by a double infinite sequence of inde- 
pendent Af(0, 1) variables {bij}°° =1 , and for every n to perform the above construc- 
tions with bi = (bii, . . . , 6 n »)'. To embed the Jacobi matrices in this framework, we 
have to restrict ourselves to the X-type and Z-type ones; however, only the Z one 
gives a natural meaning to the dynamic study. 

The starting point of our study of random determinants is the following propo- 
sition which gathers known results about the factors entering in the above decom- 
positions. 
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Proposition 2.1. 1) (Bartlett) The random variables pj. n , j = 1, ...,n are 

independent and 



p j<n = Gamma(^- ^ — , ^ 



where = stands for equality in distribution. 
2) The random variables Pj. n , j — 2, . . . ,n are independent and 

~ (d) r t (n-j + l j-1 
Pj,n = Beta 



3) For J = X (resp. y , Z), the random variables pj ni „ 2 , j = 1, • • • ,n\ are 
independent and 



(d) D , /ni - j + 1 n 2 



/ ni - j + 1 n2N 

I 2 ' 2 ; " 



^>i,n 2 = Beta 

The first claim is known as the celebrated Bartlett decomposition (stated with \ 2 
distributions) (Bartlett, 1933). It is quoted in many books and articles in particular 
Anderson (2003) pp.170-172, Muirhead (1982) Theorem 3.2.14 p.99, Kshirsagar 
(1972), Gupta and Nagar (2000) Theorem 3.3.4 p.91 and ex. 3.8 p.127. The second 
claim may be found in Anderson (2003) Theorem 9.3.3. In the third claim, we first 
note that it is enough to get the proof for Z since the three random matrices have 
the same distribution. It is a consequence of a result quoted in Anderson (2003), 
due to Kshirsagar, is proved in Muirhead (1982) Theorem 3.3.1 p. 110 under the 
assumption r < ni,n 2 and in Rao (1973) p. 541 under the only assumption r < 
m. Actually (see Muirhead (1982) ex. 3.24 and Anderson (2003) Theorem 8.4.1), 
some proofs use probabilistic arguments (as Rao (1973) and Anderson (2003)), 
Jacobian arguments (as in Gupta and Nagar (2000) Theorem 5.3.24 p. 181), or 
Mcllin transform arguments (as in Mathai (1999) Theorem 2). 



2.2. Distribution of eigenvalues and (3-models. In the study of sttionary processes, 
random matrices of the Wishart type with complex entries play an important role 
(Goodman (1963)). Less often, quaternionic entries are considered (see Hanlon 
et al. (1992)). We do not give details on the complex and quaternionic cases but 
jump to a general framework. Popularized by physicists, the modern point of view 
consists in introducing a parameter [3 taking value 1 when real, 2 when complex, 
and 4 when quaternionic, this parameter playing the role of an inverse temperature. 
The above constructions can be extended when replacing the transpose (R case) by 
the adjoint (C case) or the dual (H case). Many of the above results are then true 
when replacing in displays the factor 1/2 by the factor [3/2. 

Actually there are two ways to reach the law of determinants : 

a) directly from the distribution of matrices, and using the decomposition method 
quoted above, 

b) from the joint distribution of eigenvalues. 

The second way has been used to define the so-called /3-models, see for instance 
Forrester (2007) Chap. 2. The idea of extending the range of (3 to (0,co) is quite 
natural. As mentioned in Section 1, they correspond also to models of tri-diagonal 
random matrices (see Dumitriu and Edelman (2003), Killip and Nenciu (2004), 
Edelman and Sutton (2007)). 
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Because of the connections with orthogonal polynomials in the complex case, the 
extended families are called /3-Laguerre ensemble (or just Laguerre ensemble) in- 
stead of Wishart ensemble and (3- Jacobi ensemble (or just Jacobi ensemble) instead 
of MANOVA or Beta ensemble. 

Throughout, we use the symbol (3' for [3/2 to simplify displays. 



2.2.1. Laguerre. When (3 — 1,2,4 the joint probability density of the eigenvalues 
Xj ■ , j = 1, . . . , r of W on the orthant Xj > 0, j = 1, . . . , r is 



_1_ jj ^ (n-H-D-i e -^) ]-[ | Afe _ A .|^ , ( 2 . 8) 

j — l l<j<k<r 

and the normalizing constant is 

i v p'rn r (i + (3'j) r (p'(n - j + 1)) 



*-<»>-(*) n 



r(i+« 

This is the Selberg integral (see for instance Edelman and Rao (2005), formula 4.6 
and references therein). 

When (3 > is not 1,2,4, we consider the density (2.8) on (0,oo) r . We also 
denote the product -\? by \W\. This gives the Mcllin transform 

¥]w]f 3>s = Zy(n + s) _ / iy' rs fr T(l3'(n-k + l + 3 )) 
11 Z r L ^(n) l = \ T(P'(n-k + l)) ■ 

Remembering that if X == Gamma(a, 1/2) then 

EX , = 2 ,r^_+a) (Ai> _ a)j 

we deduce the following proposition from the uniqueness of Mellin transform. 
Proposition 2.2. We have 

\w\®flfi}£, 



3 = 1 



where the variables pf„,j = 1, . . . ,r are independent and 



pf f ( " } Gamma {(3' {n - j + 1), 1/2) . (2.9) 

We stress that our point of view is not compatible with the construction by 
(Dumitriu and Edelman (2003)) of matrix models for the (general) /3-Laguerre 
ensemble. Actually, they define a random r x r matrix where only diagonal 
and subdiagonal terms are nonzero, independent and satisfy (for n fixed): 

b£ j ( =' V 'Gamma (f3'(n - i + 1), 1/2) (1 < i < r) , 
BQ_j_ = v 'Gamma (/?'(r - i + 1), 1/2) (2 < i < r) . 

They prove that the distribution of eigenvalues of B^ r > is precisely (2.8). 

Of course we recover the determinant as a product of elements with the good 
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distribution, but the problem is that we cannot consider all r simultaneously in 
their framework, since 

(B(r) ( B W)') H1 ^H)( B H))', 

2.2.2. Uniform Gram. It is useful in the study of correlations. A correlation matrix 
is a positive definite matrix with diagonal entries equal to one. Here, there is no 
explicit expression for the law of eigenvalues. However, the expression 

_JL_ | G |/3'(n-r+l)-l 

with 

Z G,f3 ( x _ „/3'r(r-l) TT F W ( n ~ j + X )) 
J = l 

is a density on the space of symmetric (resp. Hermitian, resp. self-dual) positive 
matrices with diagonal entries equal to one, and it fits with the distribution of 
correlation matrix in the real (see (2.1)), complex and quaternion case, for the 
appropriate values of j3. This yields (Gupta and Nagar (2000) ex. 3.26 p. 130) the 
Mcllin transform 

mn p'. = Z^(n + s) = ' r(/3'(n-j + l + s))r03'rc) 
Z?' {n) ji T(f3'(n-j + l))T(f3'(n + s)) 

From (2.3), it is clear that if X == Beta(a, b) then 

M ^rT^ (">-°>- ^ 

r(a)r(a + b + n) 

Again the uniqueness of the Mellin transform leads to the proposition. 
Proposition 2.3. We have 

\G\^f[ P ff 

G 3 

where the variables p-^,j = 2,...,r are independent and 

pff ( = } Beta((3'(n j + 1), /?'(.? - 1)) • (2.11) 

2.2.3. Jacobi. If Z is distributed as in (2.2), the joint density of eigenvalues on the 
set (0 < Xj < 1 j = 1 , . . . , r) is given by 

v ' i=l l<i<j<r 

where Z r (ni, 112) is a normalizing constant (see for example Muirhead (1982) The- 
orem 3.3.4). 

For n 2 < r < m, the matrix W 2 is singular, and the Jacobi matrix I — Z has 
1 as an eigenvalue with multiplicity r — n 2 . The distribution of Z has no density. 
Nevertheless we may study its determinant. Indeed, the matrix I — Z has as 
an eigenvalue of multiplicity r — n 2 . Actually the density of the law of the non- 
zero eigenvalues of this matrix is known (see Srivastava (2003) and Diaz-Garcia 
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and Gutierrez Jaimez (1997)), so that the non-one eigenvalues of Z have the joint 
density 

Mm,n 2 ) i=1 i<i<j< n2 

where the normalizing constant is Z r (m, n 2 ) = Z, l2 (ni + n 2 — r, r) . 

We now consider matrices with elements in X = C or H. When r < min(ni, n 2 ), 
the distribution of Z has a density proportional to 

j^Wr+D-l | J _ Z |^ (na - r+ l)-l _ 

where /?' = 1 or 2. The distribution of the eigenvalues of Z has the density (on 
[0,in : 

/^ 1 ,„ 2 (Ai,.-.,A r )= (2.12) 

' (ni,n 2 ) i=1 l<i<j<r 

where 

^ (ni ' n2)= | = l r(l + /3')rG9'( ni +n 2 +j-r)) ' (2 " 13) 

is the value of the Selberg integral (see Hiai and Petz (2000) p. 118 and also Edelman 
and Rao (2005) p. 19 and references therein). 

In the "singular" case (n 2 < r < ni), the density of the non-one eigenvalues is 

/ri2,ni+n 2 -r,r(Alj ■ ■ ■ i ^112)' 

We consider an extension of the above models. For every (3 > 0, we define a 
family of distribution densities f£ ni) „ 2 on [0, i] min ("2,r) . 



F /3 _ j /^m,n a if r ^ min(m,n 2 ) 

r,rti,77 

We set by convention 



l/n 2 ,ni+n2-r,r lf n 2 < r < Til . 



min(n 2 ,r) 
\Z nij7 i 2j r\ |~^|" Aj 

i=l 

in all cases, and we call it the determinant, even if we do not define any matrix. 
For r < ni,n 2 , using (2.12) and (2.13) we obtain 

(m + s, n 2 ) 



E 



^|-2 rll! n 2jr | /3 ^ 



^ J,/3 (ni,n 2 ) 



^ r (/?'(m + n 2 + j - rj) r (g>i + j - r + 5)) 

11 r (/3'(ni + j - r)) T (/3'(m + n 2 + j - r + «)) ' 1 ' J 

If n 2 < r < m we start directly from (2.14) and (2.13) we have 
Z nf (n 1 +n 2 -r + s, r) 



E 



(\Z \P' S \ - 



-Z^ («i + "2 - r, r) 
11 T (/3'(m + j - r)) T (/3'(m + j + a)) ' 1 ' > 
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Multiplying up and down by nfe=n 2 +i ^ (P'( n i + k — r))T (/3'(ni + k — r + s)) we 
get again the right hand side of (2.15). Going back to (2.10), we have the proposition 



Proposition 2.4. For r < m : 



\z I ( - } TT 



Pi 



j,m,ri2 ' 

w/iere p^'^ n2 , j = 1, . . . ,r are independent and 

pfLn, = Beta(P'( ni -j + l),/3'n 2 ) . (2.17) 

2.3. Processes. In the three ensembles defined above, we have met arrays of in- 
dependent variables with remarkable distributions. In Section 2.1, we have dis- 
cussed the interest of studying all values of r simultaneously in the matrix cases 
(/3 = 1, 2, 4). Since the structure remains the same in the /3-models, it is meaningful 
to consider the processes (indexed by r) of partial sums. A now classical asymptotic 
regime is n, r — > oo with fixed ratio in the Laguerre and Uniform Gram case, and 
m,n2,r — > oo with fixed ratios in the Jacobi case. It means that we consider the 
asymptotic behavior determinants in a dynamic (or path wise) way. 
For the Laguerre model, we define 

logA^:=5>g^ (p<n) (2.18) 
fc=i pn 

and the process 

A^(t):=A£f ntJ) t€[0,l]. (2.19) 
For the Uniform Gram model, we define 

logA^:=f]log^;f (p<„) (2.20) 
k=i 

and the process 

A^(t):=A££ tJ , t€[0,l]. (2.21) 

For the Jacobi model, we fix n and r 2 > 0, set m = L nr iJi n 2 = L nr 2j > an d 
define 

log A^ = £ logp^ i;n2 (p < m) (2.22) 
fc=i 

and the process 

A^(i) = A^ ntJ , te[0,n]. (2.23) 

There are some connections between the above processes. For instance, in the 
real matrix model (/? = 1) we saw in (2.6) that 

LA G.l M, i|2 

that the two random variables in the right hand side arc independent and ||fej|| 2 = 
Gamma(n/2, 1/2). 

To see these connections in the general case, we use the so-called "beta-gamma" 
algebra that will be really helpful in the sequel. Details can be found in Chaumont 
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and Yor (2003) pp. 93-94. In the following relation, 7(a) denotes a random vari- 
able with distribution Gammafa, 1), and (3(a,b) denotes a random variable with 
distribution Beta(a, b). The relation is 

(7(0), 7(6)) = (0(a,bh(a + b),(l-P(a,b)) 7 (a + b)), (2.24) 

where, on the left hand side the random variables 7(a) and 7(6) are independent and 
on the right hand side the random variables (3(a,b) and 7(0 + b) are independent. 
It entails in particular 

7(H) ® P(a,b). (2.25) 



7(a) +7(6) 

Let us note that this relation can be extended at the matrix variate level. 

From the definitions (2.18) and (2.20) and owing to the equalities in distribution 
(2.9) and (2.11), we have then 

logA^logA^ + S„, (2.26) 
where S n is independent of log A^, and specified by 

L«*J 

S n (t) = J2^gei n) , te[0,l] (2.27) 

fc=i 

where k = 1, . . . , n are independent and satisfy = Gamma (fi'n, (i'n) . In 
the sequel, we begin by setting the claims for the Uniform Gram process and then 
deduce the corresponding results for the Laguerrc process. 

Using the definitions (2.18) and (2.22) and the equalities in distribution (2.9) 
and (2.17), we get, by another application of (2.24) 



log A^ r ® log A# + log A L n f +n2 r r log , (2.28) 

fb\ ~\~ ''2 

where this equality holds for all indices r = 1, . . . , m simultaneously, and the two 
processes log and log A^f +n2 are independent. 

It allows to deduce asymptotic results for the Jacobi model from those of the 
Laguerre model. 



3. Main results 

In this section, we state first a law of large numbers and fluctuations for our 
three models, and then the corresponding LDP for processes and marginals. 

Let D T = {v e D([0,T]) : v(0) = 0} the set of cadlag functions on [0,T] and 
D = {v e D([0, 1)) : v(0) = 0} the set of cadlag functions on [0,T] and [0,1), 
respectively, starting from 0. 

We use often the following entropy function 



J(u) = { 



u log u — u + 1 ifu>0 
1 if u = 

+00 if u < 



(3.1) 
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and its primitive: 

/"* t 2 3t 2 

F(t) = J J(u) du = -log*- — +t, (t>0). 

We use also the function defined in Hiai and Petz (2006), for s, t > 0: 



(3.2) 



B(s,t) :-- 



(1 + *) 



(1 + ^) 2 



I0g(l + S) - y l0gS + 



log(l+t)-yk>gt 



^^log(2 + S + t) + ii±l±^lo g (l + s + t). (3.3) 



which may also be written as 

B(s, t) = F{1 + s) - F{s) + F(l + i) - F(t) - F(2 + s + 1) + F(l + s + 1) - 7 - . 
3.1. Law of large numbers and fluctuations. 

3.1.1. Uniform Gram ensemble. Let us define a drift and a diffusion coefficient by 
Theorem 3.1. 



(3 \2 0J 1-t 
(1) As n — > oo, 



/ 2* 



(3.4) 



lim sup 



0. 



±E logA^ + j(l-^) 
(2) for every t € [0, 1), asmoo, 

ElogA^(*) + nJ (l-^) - J*d G ^(a)da 



and 



where 



ElogA^(l)+n + Q-I^logn^^, 



^:=-Iog(27r) 



1-7 

/3 



o e^/2 - 1 



ds . 



(3.5) 



(3.6) 



(3.7) 



(3.8) 



and 7 = — r'(l) is the Euler constant. 
(3) For every t <E [0, 1), as n — > oo 7 



ds 



Var logA^(l)-|logn - J , 



where 



(3.9) 
(3.10) 

(3.11) 
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(4) As n — > oo, 



lim sup 

™ te[o,i] 



logA^(t) 



+ J(l-t) 



= 0. 



(3.12) 



in probability. 



For /? = 1, the formulae (3.7) and (3.10) are due to Abbott and Mulders (2001) 
(see their lemmas 4.2 and 4.4), using a variant of the decomposition method. 

Theorem 3.2. (1) Let for n>l 

v ^(t):=logA^(t)+nj(l-^) , te[0,l). 
Then as n —> oo 

'r,™(t); te[0,lj)=>(x™; 4 6(0,1)), (3.13) 



w/iere AT G ' /3 is ifte (Gaussian) diffusion solution of the stochastic differential 
equation : 



dX?' 13 = d G ^(t) dt + a G ' (t) dB t , 



(3.14) 



G Q 

with X^ ,tJ = 0, B is a standard Brownian motion and => stands for the weak 
convergence of distributions in D endowed with the Skorokhod topology. 
(2) Let 



€' = 



logA£./»(l) +„+(i-i) log n 



llogn 



Then as n — > oo, rj '^ => N where N is A/(0, 1) and independent of B, 
(and stands for the weak convergence of distribution in R). 



3.1.2. Laguerre ensemble. Let us define a drift and a diffusion coefficient by 

1 1\ 1 



d L ^(i) : 
Theorem 3.3. 



2 0J 1-t 
(1) is m CO, 



, <^(t):= 



/3(1 - t) 



(3.15) 



lim sup 

n p<n 



= 



I E logA« + j(l-£) 

(2) For every t G [0, 1), as n — > oo, 

E log Ay\t) + nj (l - - jf* d^(s) ds : 



(3.16) 



(3.17) 



and 



E log A^(l) + n +(i-Vj] og n^K 1 p , (3.18) 
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(3) For every t € [0, 1), as n — > oo, 

Var logA^(t) -» (a Lf3 {s)^ ds (3.19) 



Var log A^ /3 (l)- -log n -» #J . (3.20) 



2 

(4) yls n — > oo, 

ilogA^(t) + J(l-t) 



sup 

t€[0,l] 



(3.21) 



zn probability. 



Remark 3.4. In the Uniform Gram and Laguerre ensembles, when all the variables 
are defined on the same space (i.e. (3 = 1, 2, 4), an application of the Borel-Cantelli 
lemma leads to almost sure convergence. 

Theorem 3.5. Let 

rj^(t) := logA^+n^l-M) , t e [0) l) , 

logA^(l) + n+(±-±) logn 



rfc'f* - 

yjj lo S' 
T/ien as n — > oo 

(f?^(t); te[o,i)) =* (a^, te[0,l)) (3.22) 

w/iere X L 'P is the Gaussian diffusion solution of the stochastic differential equation: 

dX^'P = d L,/3 (i) dt + o- L 'P(t) dB t , (3.23) 

with = 0, where B is a standard Brownian motion and N is Af(Q, 1) and 

independent o/B. 

The convergence of n^it), for fixed t and ofr}^' 1 were proved by Jonsson (1982) 
Theorem 5.1a. Recently and independently the convergence of rf^' 1 was proved in 
Theorem 4 of Rempala and Wesolowski (2005). 

3.1.3. Jacobi ensemble. In this part we use new auxiliary functions. Let 

£(x, y, z) = a; log a; - (x + y) log(a; + y) + (x + y-z) log(x + y- z) -(x-z) log(x-z) 

or using J defined in (3.1) 

£{x, y, z) = J(x) - J{x - z) - J(x + y) + J(x + y-z). (3.24) 
The partial derivative of £ with respect to x is : 

&(,,,,.) =-^ g («, y ,,)-V« ^±g^ . (3.25) 



Let for < t < n 



a 2 (t):=^-£ 1 (r u r 2 ,t) = - -. (3.26) 

at (n -t){Ti +t 2 -t) 
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Again wc define drift and diffusion coefficients: 

d W (t)=Q-^)o 3 (t) , a^(t) = ] lja(t). 
Theorem 3.6. (1) As n — > oo, 

sup -El0gA^(t)-£(Ti,T 2 ,i) -»0. 
t€[0,Ti] n 

(2) for ewer?/ t G [0,Ti) ; as n ^ oo, 

EIogA^(t)-£(LTinJ,L75nJ,LtnJ)— » / dj(«) ds, 

Jo 



(3.27) 



(3.28) 



and 1 

ElogA^(Ti)-£(LrinJ, LnnJ) 



1 1 

P ~ 2 



logn 



1 1 



2 /3 / ° n + r 2 



/3 • 



(3) for ewerj/ i £ [0, ri), as n — > oo, 



Var logA^(r) -» J (cr J ^(s)) 2 da, 



and 



Var log A J J( Tl ) - \ logn — > | log (-^-) + «| . 

V Tl + To / 



(4) ^4s n — > oo, 



sup 

te[o,ri] 



-logA^^-^n,^,*) 



(3.29) 

(3.30) 
(3.31) 
(3.32) 



in probability. 



Remark 3.7. For j3 = 1,2,4, when all variables are on the same probability space, 
the convergence in (4) may be strengthened to almost sure convergence. 

Theorem 3.8. Let for n > 1 

n^(t) := IogA^(t)-f(LTinJ,L75nJ,L*nJ) , *e[0,n), 
logA^n) - (ti.tu.ti) + (i - i) logn 



Then as n — > oo 



(n^(r); ie [0,n)) =► (X/; te [0,n)), (3.33) 



J,/3 



AT 



where X J '@ is the (Gaussian) diffusion solution of the stochastic differential equa- 
tion : 



dX( = d J ' {t)dt + <j J ' {t) dB t , 



(3.34) 



^^where Kg (rosp. Kg) was defined in (3.8) (resp. (3.11)) 
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with = 0, B is a standard Brownian motion and N is a standard normal 

variable independent o/B. 

3.2. Large deviations. All along this section, we use the notation of Dembo and 
Zeitouni (1998). In particular we write LDP for Large Deviation Principle. The 
reader may have some interest in consulting Dctte and Gamboa (2007) where a 
similar method is used for a different model, but here we use a slightly different 
topology to be able to catch the marginals in T. 

For T < 1, let My be the set of signed measures on [0, T] and let M< be the set 
of measures whose support is a compact subset of [0, 1). 

We provide D with the weakened topology a(D,M < ). So, D is the projective 
limit of the family, indexed by T < 1 of topological spaces (Dt, o{Dt, My)). 

Let Vi (resp. V r ) be the space of left (resp. right) continuous R-valued functions 
with bounded variations. We put a superscript T to specify the functions on [0, T]. 
There is a bijective correspondence between Vj and My : 

- for any v € Vj , there exists a unique \i € My such that v = fj,([0, •]); we denote 
it by v , 

- for any /i G M-r, v = fi([0, •]) stands in V r . 

For v e D, let v = v a + v s be the Lebesgue decomposition of the measure v € 
M([0, 1)) in absolutely continuous and singular parts with respect to the Lebesgue 
measure and let /i be any bounded positive measure dominating v s . 

For A C [0, 1) and v e D let 

Ia(v) = J L a (t,^(t))dt + J L a (t,^(t))dii(t) if vGV r , (3.35) 

and Ia{v) = oo if v e D \ V r , where functions L a (t, x) and L s (t, x) will be defined 
later for each of the ensembles of interest. 



3.2.1. Uniform Gram ensemble. For the following statement, we need some nota- 
tion. Let H be the entropy function : 

x 1 — x 

~R{x\p) = x log - + (1 - x) log y— , 

and put 1 

L°(t,y) = H(l-t|e»)<y(tf|(-oo,0)), 

Lf(t,y) = -(l-t)y5(y\{-oo,0)). (3.36) 

Theorem 3.9. The sequence {n^ 1 logA^ ,,3 (t), t e [0, 1)}„ satisfies a LDP in 
(D, a(D, M<)) in the scale 2/3~ 1 n~ 2 with good rate function I^iy 

That means, roughly speaking, that 

P(logA^ ~ nv) w e-^ 7 W) w . 
The proof, in Section 6.1, needs several steps. Let 9^ = n~ x log A^, so that 

®n=-E( l0 g"n/)^/n. (3-37) 



we set 6(y\A) = if y £ A and = oo if y f A 



18 



Alain Rouault 



First we show that {6^} satisfies a LDP in My equipped with the topology 
<j(Mt,V(). Then we carry the LDP to (Dt,ct(Dt,Mt)) with good rate function: 

few - i TI t «))* + i [m <<. ■ <« 8 » 

To end the proof we apply the Dawson-Gartner theorem on projective limits (Dcmbo 
and Zeitouni (1998) Theorem 4.6.1, see also Leonard (2000) Proposition A2). 
Let us note that Jjp T j (v) vanishes only when v satisfies (essentially) 

§«) = log(l-«) , f«) = 0, (3.39) 

i.e. for v(t) = —J{1 — t), which is consistent with the result (3.12). 

The LDP for marginals is given in the following theorem, where a rate function 
with affine part appears. 

Theorem 3.10. For every T < 1, the sequence {n^ 1 log A^(T)} satisfies a 
LDP in R in the scale 2[3~ 1 n~ 2 with good rate function denoted by 

/£(0=ini{/£ T] ( W ) ; v(T)=£}. (3.40) 

(1) If£ e [-T,0) the equation 

J(l + 9) — J(l — T + 9) — T log(l + 6)=£, (3.41) 
has a unique solution, and we have 

It(0 = ^ + TJ(l + 9) (3.42) 
+ {F(l)-F(l-T)-F(l + 9) + F{l-T + 9)) . 

(2) If£< -T, we have 

I?(0=lT(-T)-(l-T)(t + T). (3.43) 

(3) J/£>0, J^(0=oo. 

3.2.2. Laguerre ensemble. Let 

Lj(t,y) - ( e y-l)-(l-t)y + J(l-t) 

L L s {t,y) = -(l-^^K-oo.O)). (3.44) 



Theorem 3.11. TTie sequence {n 1 log A^'P(t), t £ [0, 1)}„ satisfies a LDP in 
(D, a(D, M<)), in i/ie scale 2/3~ 1 n~ 2 with good rate function 1^ ^. 

That means, roughly speaking, that 

P(logA£ ~ nv) w e _! T f Ri)(») . 

The proof uses the above result for the Uniform Gram process and the beta-gamma 
algebra. 

Let us note that Ife T j (v) vanishes only when v satisfies (3.39) (again) i.e. for 
v(t) = —J(l — t), which is consistent with the result (3.21). 

The LDP for marginals is given in the following theorem. 
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Theorem 3.12. For every T < 1, the sequence {n 1 log A^ ,/3 (T)}„ satisfies a 
LDP in R in the scale 2(3~ 1 n~ 2 with good rate function denoted by 1^. 

4(0=mi{I^ T] (v) ■ v{T)=£}. (3.45) 

(1) If£,>£,T ■■= J{T) - 1 the equation 

J(l + 6)-J(l-T + 9) =<£. (3.46) 
has a unique solution, and we have 
4(0 =0£ + F(l) - F(l -T) — F(l +0) + F(l — T + 6) . (3.47) 

(2) If £, < £t , we have 

IZ(Q = IZ(T) + (1-T)(£ T -Q. (3.48) 

3.2.3. Jacobi ensemble. In this subsection, the endpoint 1 of subsections 3.2.1 and 
3.2.2 is replaced by n. 
Let, for t < ri, 

L J a (t,y) = ( Tl +r 2 -t)H( eA 

V T\ + T 2 — t / 

L J s (t iy ) = -(Ti-t)yify<0. (3.49) 

Theorem 3.13. TTie sequence {n~ x log A^(t), i e [0,ti)}„ satisfies a LDP in 
(D,o-(D,M < )) in the scale 2(5~ 1 n~ 2 with good rate function I[o, Tl )- 

That means, roughly speaking, that 

P(log ~ nv) ss e"T f [»,»W . 

Let us note that T j (u) vanishes only when v satisfies (essentially) 

dv a n-t dv s 

at T\+T 2 — t d[i 

i.e. for v(t) = F,(Ti,T 2} t), which is consistent with the result (3.32). 
The LDP for marginals is given in the following theorem. 

Theorem 3.14. Let T e [0,n), and & = J(t 2 ) + J{T) - J(T + r 2 ) - 1. 

(1) The sequence {n^ 1 log A^^(T)}„ satisfies a LDP in K in the scale 2/3~ 1 n~ 2 
with good rate function 1^ where 

4(0 :=mi{lf 0iTl (v) ; v(T) = £} . (3.50) 

(2) If £ e [6^,0), the equation 

£(e + T U T*,T) = S (3.51) 
has a unique solution 6 > T — t\, and we have 
4(0 = H - [F(6 + n)-F(6 + T 1 -T)}-[F(T 1 +T 2 )-F(T 1 +T2-t)} 
+ [F(t\) — F(n — T)] 

+ [F{6 + n + r 2 ) - F(0 + n+T 2 - T)\ . (3.52) 

(3) If £ < we have 

4(0 = 4(4) + (4 - 0(n - r) . (3.53) 

(4) lf£>0, then = oo. 
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4. Connections with the spectral method 

The logarithm of the determinant of a non singular matrix is a linear statistic 
of the empirical distribution of its eigenvalues, so that we may compare the above 
result with those obtained by this spectral approach. 



4.1. Laguerre/Wishart. We start with 

1 r f 

-logA^f = - / (logs) dfj, nir (x) 

n n J 

where [i n ,r the so called empirical spectral distribution (ESD) is 

Jl>* (4-1) 



r 

fe=i 



For c > and cr > 0, let 7r£ 2 be the distribution on R defined by 



1/2 



< 2 (ds) = (1 - c-XSoidx) + - ' \ dx, (4.2) 

Zttct cx 

where 5q is the Dirac mass in 0, x + — max(i, 0) and 

a(c) = (l-^) 2 , 6(c) = (l + ^) 2 . (4.3) 

It is called the Marcenko-Pastur distribution with ratio index c and scale index a 2 
(Bai (1999) p.621). 

It is well known (Marcenko and Pastur (1967), Bai (1999) section 2.1.2 for the 
cases = 1 and = 2) that as n,r — > oo with r/n — > T e (0, oo), the family of 
ESD (/i n , r ) converges a.s. weakly to 7rf . If we replace the common law J\f(0, 1) by 
Af(0, cr 2 ) then the limiting distribution is 7rJ 2 - 

To conclude that 

lim J (logs) d[i n ^(x) = j (\ogx)dnl(x) , (4.4) 

an additional control is necessary, since x logx is not bounded. 

Actually, the largest and the smallest eigenvalue converge a.s. to b(T) < oo and 
a(T) > 0, respectively. For comments on these results and references, one may 
consult Bai (1999) sections 2.1.2 and 2.2.2., (see also Johnstone (2001)). In our 
context, this implies easily that a.s. 

- log A^f = - f (logs) 4vr(s) -> T [ logs dirf (s) (4.5) 
n n J J 

Moreover, it is known (Jonsson (1982) p. 31 and Bai and Silverstein (2004) p.596- 
597) that : 

/MT) i 
(logs) dvl (s) = / ^ ^(x-a(T))(b(T)~x)dx 
Ja(T) 27TS 

= (T-l)log(l-T)-T=-J(l-T) (4.6) 

which implies that claim (4.5) is consistent with (3.21). 

Recently, Bai and Silverstein (2004) proved a CLT for linear statistics of sample 
covariance matrices (non necessarily Gaussian), with the meaningful example of 
determinants. They consider the real and complex case, and their results (Theorem 
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1.1 ii) and iii) are consistent with the marginal version of (3.22). It is likely that 
[3 = 4 can also be handled under their assumptions. 

Let us end with the large deviations. Hiai and Petz (1998), (see also Hiai and 
Petz (2000) section 5.5) proved 2 that if n — > oo and r/n — > T < 1, then {/i n ,r} 
satisfies a LDP in Mi([0, oo)) in the scale 2/3~ 1 n~ 2 with some explicit good rate 
function I^ L given below in (4.8, 4.9, 4.10). If the contraction \i ^ Jlogx dfj,(x) 
were continuous, we would claim that {n _1 log ^ nT j} n satisfies a LDP in R in 
the same scale, with good rate function 



#(0 



inf j/* pi (u) ;T Jlogx d»(x) = . (4.7) 



Actually, 



I<? L (H) = -T 2 S( M ) + T J {x - (1 - T) logs) dfj,(x) + 2B(T) (4.8) 

where 

S(M) := j f log \x - y\ d^x)dfi(y) (4.9) 

is the so-called logarithmic entropy and for T € (0, 1) 

2B(T) = -i (3T - T 2 logT + (1 - T) 2 log(l - T)) . (4.10) 

We do not know if the contraction ^ Jlogx d/z(x) does work, although not 
continuous. However we will prove the following result, where for u e R we put 

A{u) = : J (log a;) d^(ai) = u} . (4.11) 

Proposition 4.1. For £ > £ T 6* solution of (3.46), let a 2 = I + 8. Then the 
infimum ofI^ >L (fi) over A(t;/T) is uniquely achieved for i^i" and 

It® = It VL ^) = mf{/T PL (M); M e A(£/T)} . (4.12) 

Remark 4.2. (1) The endpoint is £ T = J (T) - 1, with a 2 = T. 

(2) For £ < £t we do not know what happens. We can imagine that the 
infimum in (4.12) has a solution in some extended space. 

4.2. Uniform Gram. Let Afe,fc = l,...,r be the (real) eigenvalues of G in the 
Uniform Gram ensemble, and set 

~^r = \Y,h k - (4-13) 

fe=l 

For /? = 1, De Cock et al. (1999) proved that, as n — > oo and r/n — > T e (0, oo), 
the family (/U„ ;T .) converges a.s. to 7tf . More recently Jiang (2004) proved that 
the same result holds true in a complex Gram ensemble not necessarily uniform. 
Again, like in Section 4.1, we may write 



-iogAn'f = - / (logs) d/J, ntr (x) 

n n J 



2 Their j3 is our fi 1 . 
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and use the weak convergence of j2 n ,r towards nf . Recently, Jiang (2004) proved 
that the largest and the smallest eigenvalue converge a.s. as r/n — > T < 1 to 
b(T) < oo and a(T) > respectively. So, we have 

lim J (logx) djl ni r(x) = J (logx)dirJ(x) . (4.14) 

In view of (4.6), this matches with the result (3.12). 

No result on fluctuations or large deviations seems to be known on /U„ >r . 

4.3. Jacobi. In the matrix models (f3 = 1,2 or 4), take r < ri\ and let Xk,k = 
1, . . . , r be the eigenvalues of £ ni ,n 2 ,r (they are real nonnegative) . The ESD is 



1 r 

V ni ,n 2 ,r = — ^Afc • 
fe=l 

When ii2 < r < Hi we have 

-V.L x. (\ _ 1^ a 

r V r / 

where /x nij „ 2jr is the ESD built with eigenvalues different from 1. We can write in 
all cases 

log A^f = min(r,n 2 ) J (logx) fj, nun2 ,r(dx) . (4-15) 

It is then possible to carry asymptotical results of this empirical distribution to 
logA#. 

Capitaine and Casalis (2004) studied the complex case in the asymptotical regime 
n\jr — > u', 112/r — » f' with w' + 1>' > 1. They prove 3 that Ei/„ ljn2jr converges (in 
moments hence) in distribution. To give the expression of the limiting distribution, 
which we denote CC u ' iV / and to compare with known results in some other contexts 
with coherent notation, we will use in the following, four functions : 
for (b,c) e (0,1) x (0,1) we put 



*±(b,c) = ±[l + Vte±y/(l-b)(l-cj\ , (4.16) 



and for (x,y) e (0, 1) x (0, 1) 



a±(x,y) = (l-x-y + 2xy) ± 2^ x(l - x)y(l - y) 

= (y/(l-x)(l-y)± y /xif . (4.17) 
The mappings a± and a± are inverse in the following sense : 

{(b,c) : < b < c < 1} {( x ,y) : < a: < y < 1 and i + 1/ > 1} (4.18) 

(o-,o+) 

For < a_ < a+ < 1, let 7r a _ ia+ be the distribution on K defined by 

ir,_, = C a _, a+ ^ (g 2 ~ "ff^ -^ 1 [a-a + ](») (4-19) 



They use the notation a and /3 but we change not to confuse with f} already defined. 
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where C a _. a + is the normalization constant. Since we found some mistakes in 
the literature, let us compute explicitly the constant C a _, a+ - From the obvious 
decomposition 



1 1 

+ 



x(l — x) x 1 — x 
we get 

(C a _, a+ )" 1 = I(a-,a + ) + 1(1 - o+, 1 - o_) 
where, for < u < v 



[ v y/Jx -u){v- x) 

I(u,v) — I — ax 

V ' Ju 2nx 

This last integral could be calculated by elementary method, but it is shorter to 
connect it with the Marcenko-Pastur distribution. Taking 



<y = : , V c = - — — — 



4 ' Vu 
the simple fact that 7r£ 2 , given in (4.2), is a probability distribution yields 



I(u,v) = ^ v ' (0<u<v). 



Finally, we get: 



(Ca^a+r 1 = g [1 - V^T-v/(l- «-)(!- «+)J • (4-20) 
The distribution CC u < jt ,< is then (recall m' + v' > 1) : 

CC„/,„- := (1 - w') + 5o + (l-v')+S 1 (4.21) 

+ [l-(l- U ') + -(l-«') + ]^_,a + , 

where 

(a_,ffl+) = a ± [ " , ,1 — - ] . (4.22) 

Remark 4.3. The case (u' < 1) corresponds to r > ri2, the second matrix W2 is 
singular and the case (v' > 1) corresponds to r < n 2 , the second matrix is non- 
singular. 

For particular values of the parameters and up to an afHne change to make the 
distribution symmetric, the distribution 7r a _. a+ was introduced by Kesten (1959) as 
limit distribution for random walks on some classical groups. It was (independently) 
introduced by McKay (1981) as a limit distribution in a graph problem. It is 
sometimes called the generalized McKay distribution. Some important connections 
are in Section 9. 

For the LLN, the same remarks as above are relevant. Let us recall the notation 

r < m , n^oo, > T , ► n , > r 2 , u = — , w = — . 

n n n 11 

The weak convergence of the ESD (Capitaine and Casalis (2004)) and the control 
on the extremal eigenvalues (Ledoux (2004), Collins (2005) and references therein), 
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yield, if u 1 > 1 

lim-logA^(T) = j{\ogx)CC u , y {dx) 
n r J 

i(v',l) J (logx)ir a _, a+ (dx) (4.23) 



mm 



where a± are in (4.22). Nevertheless a computation of this integral by elementary 
methods is not so easy. After some attempts, we choose to consider the above result 
as an indirect way to compute this integral and we obtain the following result. 



/< 



Proposition 4.4. For < a_ < a + < 1, 

(logs;) ir a _, a+ (dx) = (4.24) 
_ a + log g + + a- logcr- - (cr+ + (J- - 1) log(c+ + <T- - 1) 

where a± are specified by (4-16). 
Proof: From (3.32), 

limilogA^(T) = ihmilogA^(T) = ± £(r u r 2 , T) = £(u', v',1) , 

n r Inn 1 

where for the last equality we noticed that £ is homogenous. With the help of 
(4.23) we get 

min(t/,l) J{logx)TT a _, a+ {dx)=£(u',v',l). (4.25) 
From (4.18) we see that if v! > 1 then 

v! u' + v' - 1 ' 



u' + v' ' tt' + v' 



We have two cases. When v' > 1, 



w' it' + - 1 



u' + w' u' + w' 

so that (4.25) yields 

(loga;)7r a _. a+ (dx) = £(u',v', 1) =£(-^—, l) (4.26) 

VI — <7_|_ 1 — (7+ / 

When v' < 1, 



u' + u' - 1 



(lo g2 ;)7r a _, a+ (dx) = ±j£(u',v',l) = e(-^—,\-^,\) (4.27) 

V 1 — (7+ V 1 — <7_ 1 — <7_ / 



?i' + w' u' + l/ 

so that (4.25) yields 

1 f/ , , 1 — er_ ^ cr_|_ 1 — a + 

and together (4.26-4.27) provide (4.24). This ends the proof. 

Let us remark that the first case above (v' < 1) corresponds to T > t 2 (i.e. 
r > n 2 , the second matrix VF2 is singular) and the second one (v' > 1) corresponds 
to T < r 2 (i.e. r < n 2 , the second matrix is non-singular). □ 

Let us end with the large deviations. In the complex case (/? = 2), Hiai and Petz 
(2006) proved that if n — ► 00, n\/n — ► Ti,n 2 /n — > r 2 > T\,r/n — > T < n, then 
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{A i ni,n2,r}n satisfies a LDP in A4i([0, 1]) the set of probability measures on [0,1] 
endowed with the weak convergence topology, in the scale n -2 , with the good rate 
function 

It PJ M ■= -T 2 np) - T ( ((n - T) \ogx + (r 2 - T) log(l - x)) dfi(x) 

Jo 

+ T 2 s(^,^), (4.28) 

where B is defined in (3.3) (it is the limiting free energy). 

A computation similar to p. 10 of Hiai and Petz (2006) gives the same result for 
general [3. 

Proposition 4.5. IfT < n < r 2 , the family {n[ n T 1 \,\nT 2 \\nT\} satisfies a LDP in 
A^i([0, 1]) in the scale 2(3~ 1 n~ 2 and good rate function I^ J . 

If the contraction /jh| logo; d/j,(x) from the set Mi([0, 1]) to R were continuous, 
we would claim that {n -1 log A^(T)}„ satisfies a LDP in R with good rate function 
Ij, where 

7^(0 = inf {l^ J (p) ; M e A^T- 1 )} (4.29) 

with A(u) as defined in (4.11). 

Like in the Laguerre case we will prove the following result. 

Proposition 4.6. Let T < min(-ri,T 2 ), £ £ [£y,0) and 9 solution of (3.51). Then 
the infimum of I^ J (//) over A(^T is uniquely achieved at /i — 7Ta_.a + where 

(a_,a+) = a±(S_,S+) 

n+fl . Ti+6> + r 2 -i 

- s + = - ( 4 - 3 °) 



Tl + T 2 + 9 T!+T 2 + 9 

and 

4(0 = IT W,a + ) - inf{/^ J (M); M G -A^- 1 )} • (4.31) 
Remark 4.7. The endpoint is which corresponds to = T — n, i.e. 

- n - 4t 2 T 

a - = ' a + = 7 7^2 ■ 

(r 2 + T) 2 

For C < we do not know what happens. We can imagine that the infimum in 
(4.29) has a solution in some extended space. 

Remark 4.8. In the range r 2 < T < n we have a similar result, exchanging s_ and 
s+ in (4.30). We omit the details. 

4.4. Extensions. We already mentioned that in the Wishart and Gram models, 
limiting results exist for marginals when we leave the Gaussian/Uniform world, in 
particular for fluctuations in Bai and Silverstein (2004). 

The Bartlett decomposition is not possible in the general case. Nevertheless, a 
product formula for the determinant is well known (sec for example Lemma 3.1 p. 9 
and formula 4.3 p. 15 in Friedland et al. (2004)), but nothing can be said about the 
distribution of the components of the product in general. 
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Nevertheless, if the columns (or the rows) of the matrix B are i.i.d. and isotropic, 
the previous results extend easily. 

Let us begin with the "column" case. The beta-gamma algebra allowed us to pass 
from the Uniform Gram ensemble to the Wishart ensemble. The polar decomposi- 
tion allows to obtain similar results as for the Wishart ensemble under convenient 
assumptions on the radial distribution. Let e n — log ||fei|| 2 — logE||6i|| 2 (remember 
that we omit the dimension index n). To get convergence and fluctuations it is 
enough to assume 

riEe n — ► di , nVare„ — > a 2 , nE(e n — Ee„) 4 — > . (4.32) 

To get large deviations, it would be sufficient to assume that, for some convenient 
functions ip, the quantity n~ 2 Y^l=i logEexp {n<fi(k/n)e n ) has a limit . 

Akhavi (2002) uses the uniform distribution in the unit ball, so that the dis- 
tribution of ||6i|| 2 is Beta(n/2, 1) and (4.32) is satisfied with a\ — —2, a 2 = 0. 
The contribution of the radial part is then roughly "deterministic" since E||6i|| 2 is 
bounded. 

In the "row" case, we can use the results of the "column" case since the eigen- 
values of BB 1 are (except with multiplicity n — r) the same as those of B'B. 



5. Proofs of Theorems of Section 3.1 

5.1. Proof of Theorem 3.1. We will use Mellin transforms and their first two deriva- 
tives at 9 = 0. From the decomposition (2.20) we have 

logE|A«;;r* = 

k=i 

with 

k G J{6) :=logE 

and from (2.11) 

A £f (*) = — k + 1 + 6)) -t[p\n - k + 1)) +£(P'n)-£ {[3'(n + 6)) (5.3) 

where we set 

£(x) = logF(x). 
Proof of 1) and 2) Differentiating once, we get 



E log = £ [* (/?> — j + 1)) — \& (f3'n)} , 
and from Binet formula (8.5), 

E log A£j? = l°g ~F~ + ~p ( H n-r + C • (5-4) 

in which 

1) (p)r = p(p ~ 1) " " " (P ~ r + 1) is the falling factorial 

2) f/o = and H p = 1 + | + • • • + i are the harmonic numbers 

3) the delta term is 

/>oo r 

C = / s/00£[e^' ( "- fe+1)s - e"" s ] ds. (5.5) 



(5.2) 
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Using Binet formula (8.1) twice, we have for r < n 

= -(-• ■ + 5) h «( 1 -;)--r /w[ 



e - {n - r) - e- sn ]ds 



-^('-;)-W('-5)-jC 



-s(n — r) p —sn 



}ds. 



For r ~ n the Stirling formula gives 



log 



(n) n 



-n + - log(27rn) + o(l) , 



The harmonic contribution in (5.4) is 

ff„_ r - JT n = log (l - +o(l) 

as soon asn-r^oo. For r = n, we have 7J — = — l°g n — 7 + • Applying 
the dominated convergence theorem and (8.4), we see that the delta contribution 
satisfies: 



sup 6„ = 6* 



ds , 



and lim n y nt ^ = for t < 1. Gathering all these estimates, and applying again 
the dominated convergence theorem, we get (for n — r —* oo) 

Elog^ = -nj (l - I) + £ + Q - log (l - I) + o(l) , 
and for r = n 

Elog A££ = -n - Q - logn + J$ + o(l) . 



Moreover, for the suprcmum, we have 



sup 



: log a^- log 



(n) r 



sup 



log 



(n) r 



+ 



1 



= O(logn) 
= O(logn) 



so that (3.5), (3.6) and (3.7) are proved. 

3) Taking logarithms in (5.1) and differentiating twice, we get 

r 

Var log = ^ *' (/3'(n - j + 1)) - *'(/3'n) 
and owing to (8.9) 



where 



Moreover 



Var log = ^j(H n - H n _ r ) + e 



n— r+1 



Var log A^ = lff n -l+^ 
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where 



51 = r 3 ( S/(S) + ^) pl^' {n - k+1)S z-"' nS \ ds ■ 
Applying again the dominated convergence theorem and (8.4), we get 



lim St 



f (f/(f) + \) 

e (3>s _ l 



ds 



Using (8.4) and dominated convergence we deduce easily (3.9) and (3.10). 

To prove 4), let us note that since J is uniformly continuous on [0, 1] we have 



lim sup 

n te[o,i] 



J 1 



L»*J 



o. 



so that, owing to (3.5), it is enough to prove that in probability 
sup | log A^f-E log A^f | =o(n). 

l<p<n 

Actually this convergence is a consequence of Doob inequality and of the variance 
estimate Var n _1 A^'^ = 0(n~ 2 \ogn) coming from (3.10). □ 



5.2. Proof of Theorem 3.2. Let us first note that, thanks to the estimations of 
expectations in (3.6) and (3.7), we can reduce the problem to the centered process 
and centered variable : 



5 n (i):=logA^(t)-ElogA^(i) , 6 n = 6 n (l)/y/(2/(3) logn . 

1) We have 5„{t) = J2k^i V^k where 

Vn, k := (log p^)-E(logp^), k<n (5.6) 

is a row- wise independent arrow. To prove (3.13) it is enough to prove the con- 
vergence in distribution in B([0,T]), for every T < 1, of 8 n to a centered Gauss- 
ian process with independent increments, and variance J * ^cr G ^(s)^ ds. To this 

purpose we apply a version of the Lindeberg-Levy-Lyapunov criteria (see Dacunha- 
Castelle and Duflo (1986) Volume II Theorem 7.4.28 , or Jacod and Shiryaev (1987, 
Chap. 3 c)). For t < 1, from (3.9) it is enough to prove that 

L«*J 

lim^E« fe )=0. 



(5.7) 



k=i 



We have from definitions (5.6) and (5.2) 

/3' 4 E(< fe ) = (A^)( 4 )(0) + 3[(A«f)( 2 )(0)] 2 , 
On the one hand, from expression (5.3) 

(<f ) (4) (0) = /3' 4 [* (3) (/3> - k + 1)) - ¥ 3 Hf3'n)} 
and Binet estimates (8.8), (8.9) for q = 4 yield 



(5.8) 



E«f) (4) (°)-^'£ 



fc=l 



fe=l 



(n-k + 1) 3 n 3 



p 1 
< 6 V - 

" ^-(n-jfc + 1) 4 ' 
fc=i v ; 



(5.9) 
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which, for < t < 1 and p = \nt\ yields lim„ ^fe=i ( A n/fc ) (°) = ■ 0n thc 
other hand, 

E[(A^;f)"(0)] 2 < (sup(A^;/)"(0)) E(A«f )"(0) . (5.10) 

fe=i J - p fe=i 

We already know, from (3.9) that 

P'- 2 E( A &f)"(°) - Varlog A^) -» /' (a G ^( S )) 2 da. 
fc=i ^° 

Now since (A^f )"(0) = /? /2 [tf'(/?'(n - k + 1)) - *'(/3'n)] and since *' is non- 
increasing (see (8.8)) we obtain 

sup (A£?)"(0) < f3' 2 *' (f3'(n - [nt\ + 1)) , 

j<lnt\ 

and from (8.9) (again), this term tends to 0. We just checked (5.7), which proves 
that the sequence of processes {d n (t),t € [0, 1)}„ converges to a Gaussian centered 
process W with independent increments and the convenient variance. It is now 
straightforward to get equation (3.14). 

2) When t = 1, most of the sums studied above explode when n tends to infinity 
and we need a renormalisation. In fact, for every n, the process (S n (t),t G [0, 1]) has 
independent increments. The conditional distribution of S n (l), knowing S n (ti) = 
ei, ... , S n (t r ) = e r for t\ < ■ ■ ■ < t r is the same as e r + Y^[ n t r ]+i Vk,n- Formulae 
(3.9) and (3.10) yield 

n 

E(^J = (2//3)logn + 0(l). (5.11) 

[nt r ] + l 

Actually we can apply the Lindeberg theorem (with the criterion of Lyapunov) 
to the triangular array of random variables rjk,n — r)k,n/^/ (2//3) logn with with 
k = [nt r ] + 1, . . . , n. It is enough to prove 

n 

lim^E(^ ; J=0. (5.12) 

fe=l 

We start again with the decomposition (5.8). From the above estimate (5.9), the 
sum Efc=i( A n,f ) (4) (0) is bounded. In (5.10), we have 

n 

E(<* )"(°) = ^'" 2Var ^^(l) 

k=l 

which is equivalent to 2 logn (see (3.10)) and the supremum in (5.10) with p = n 
is bounded. This yields 

n n 

E E (t) = /3' 2 (logn)- 2 ^E« n ) = O(aogn)- 1 ) 
fe=i fe=i 
which proves (5.12). 

Then X^ tr ] +1 rjk,n converges in distribution to Af(0, 1), and the same is true 

for the conditional distribution of S n knowing S n (ti) = £i, . . . , S n (t r ) = e r . Since 
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the limiting distribution does not depend on ei,...,e r , we have proved that S n 
converges in distribution to a random variable which is Af(0, 1) and independent of 
W. □ 



5.3. Proof of Theorems 3.3 and 3.5. It is of course possible to follow the same 
schemes of proof. Actually we prefer, at least for the beginning, exploit the beta- 
gamma algebra and the fundamental relation (2.26). So, for instance 



E 



0'6 



\/3'n) T{[3'n) 



hence 



logE 



.(«) 



p'8 



£((3'(9 + n))-£ {(3'n) - [3' 9 log ((3'n) , 



(5.13) 



which provides estimates for the expectation and the variance. Differentiating once 
and taking 9 = 0, we see that 



Elog4 Il) = * {P'n) - log (fl'n) = -±- 



e- sfi ' n sf( S ) ds 



(see (8.5), (8.4)), which gives 



sup 



np 



O 







(5.14) 



Besides, differentiating (5.13) twice and taking 9 — again, we have 

Var(log4"))=*'(/3'n) = -^+oQ 
(see (8.9)), which yields 



sup 

p<n 



Var S n p — 

pn 



(5.15) 



From (5.14) and (5.15) it is easy to check (via a fourth moment estimate) that S n 
converges in distribution in B([0, 1]) to 



(-(t//5) + V^B t , te[o,i]) 



where B is a Brownian motion independent of (A„ >" , n G 



Finally the family of 



processes A^'^ 3 = + S n converges in distribution towards 

(x^ - (t/f3) + y/2/0Bt, t e [0, 1)) . 
It is a Gaussian process, whose drift and variance coefficients are 



which identify the process X L ^ . 
Besides, we have 



57^(1) =C ,/3 (1) + 



5n(l) 

V21ogn ' 
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so that the convergence of ^ ,/3 (l) is clear. Moreover the independence properties 
seen in Theorem 3.2 remain true. □ 



5.4. Proof of Theorem 3.6. Again, we could follow the same schemes as in the Gram 
section. Actually we take again the benefit of beta-gamma algebra. Let us delete 
the superscript (3 for the sake of simplicity. 
Let us recall the equality in law (2.28) 

i L M,„„aJ , ,„„*! -i-- ni 



logA^, r ^ logA^ ;r + logA£ i+ „ 2ir -rlog 



m + n 2 



with independence in the left hand side. 
We deduce easily 

E log Ai r = E log A* , r - E log A* + „ 2 >r + r log ■ 

n\ + ri2 

and 

Var log Ai r = Var log A^ r - Var log A^ 1+ „ 2jr 

The results are now straightforward. We let the proof to the reader. We just note 
that since r/ni — > t/n and r/ (ni + ra 2 ) — > i/ (ri + r 2 ) then 

ElogA^ + mjfl-^) -> J* /Tl d L '?(s)ds 

t/(r 1 +T 2 ) 



f r \ ft/(r 1 +r 2 ) 



hence 



d J./»( t ) = I d i (T) 



n \n/ ti+t 2 \ti+t 2 

In the same vein 

( CT ^) (n)) ri + r 2 (n +t 2 ) ) 



5.5. Proof 0/ Theorem 3.8. Again, it is possible to follow the classical scheme. 
Instead let us look at the situation we are faced to. Put 

C/„ = logA^ r -EA^ r , K=logA£ i+ „ 2ir -ElogA£ i+ „ 2ir , 
W„=logA£ r -ElogA£ r . (5.16) 

so that U n = V n + W n with U n => U and V n => V, where U and V are Gaussian 
processes with independent increments, and V n and W n are independent. Looking 
for instance at characteristic functions, it is clear that W n converges in the sense 
of finite distributions to a Gaussian process with independent increments. Its drift 
and variance are the difference of the corresponding ones. Moreover, since {U n } n 
and {V n } n are tight, {U n - V n }„ is tight. 
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6. Proofs of Theorems of Section 3.2 

6.1. Proof of Theorem 3.9. Recall the notation 9^ = n _1 log A^ ,/3 . As mentioned 
after the statement of the theorem, we are going to prove at first the LDP for the 
restriction of 9^ to [0, T], viewed as an element of My, in the scale /?' -1 n~ 2 with 
rate function 

f ,T] (m) := £ L° ( t> (t)) dt + £ Lf (t, ^ (*)) d„(t) . (6.1) 

Let Ve be the set of functions from [0, T] to R which are left continuous and have 
bounded variation, and let V£ be its topological dual when Ve is equipped with the 
uniform convergence topology. 

Actually 9^ £ My may be identified with an element of V e * (see Leonard (2000) 
Appendix B): owing to (3.37) its action on ip e Ve is given by 

1 V nT \ 

<6^> : = ^(fc/n)logp££. 

k=l 

The proof of Theorem 3.9 is based on the ideas of Baldi theorem (Dembo and 
Zeitouni (1998) p. 157). The main tool is the normalized cumulant generated func- 
tion (n.c.g.f.) which here takes the form 

<f„Tj i<P) := ^2 l0 § E [ CX P (P' n2 < , ¥> > )] • (6-2) 
Owing to (6.1) we have 

<'&Tj (V) = i E Aft? ( 6 - 3 ) 
^ fc=i 

and from (5.2) it is finite iff ip(k/n) > — (n — k + l)/n for every 1 < fc < [«TJ . 

In Subsection 6.1.1, we prove the convergence of this sequence of n.c.g.f. for a 
large class of functions <p. It will be sufficient, jointly to the variational formula 
given in Subsection 6.1.2 to get the upper bound for compact sets. Then Subsection 
6.1.3 is devoted to exponential tightness, which allows to get the upper bound for 
closed sets. However, since the limiting n.c.g.f. is not defined everywhere, the lower 
bound (for open sets) is more delicate than in Baldi theorem. Actually a careful 
study of exposed points as in Gamboa et al. (1999) is managed in Subsection 6.1.4. 
We end the proof in Subsection 6.1.5. 

6.1.1. Convergence of the n.c.g.f. Let, for t € [0, 1] and 9 > —(1 — t) 

g G (t, 9) := J(l-t + 9)- J(l - t) - J(l + 9) . (6.4) 
Lemma 6.1. If f e Ve satisfies ip(t) + 1 — t > for every t e (0, T], then 

Y T £ «>Tj (f) = A ^T](^) == £ 9 G (t, <p{t)) dt . (6.5) 

Proof: The key point is a convergence of Riemann sums. From (5.2) and (8.1) wc 
have, for every 9 > — , 

A?'i{n6) = 0{n - k + nO) log ( 1 - - + 9 + -) - 0{n - k) log + 

V n n) \ n nJ 

-p'{n -1 + nB) log(l + 9) + R n , k (9) 
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where the quantity 

poo 

Rn.k(O) = / f( S )e^' s 
Jo 



e -/3'(n-k+n0)s _ Q -l3'(n-k)s 



-f3'(n-l+ne)s _ -P's -0'(n-l)s 



ds 
ds 



is bounded by 2 f °° e 13 s f(s) ds. If we set 

$„(t) := {1- t + <p{t)) log (l-t + <p{t) + -) 

-(1 - t) log(l -*+-)- (l- - + log(l + p{t)) 
n \ n l 

then, making 6 = ip(k/n), and adding in k, we get from (6.3) 

|ntj L«*J 
^( £ -,L».j^) - E = - E *» 



/•LntJ/n / 

= / *„( 

Jl/n \ 



[ns\ 



ds + 

n 



n 



On the one hand, since 95 is left continuous, lim„ <f>„ ^-^-^ = g(t,ip(t)) for every 
f € [0,T]. On the other hand the following double inequality holds true: 
*„(i) > (l-i + ^(i))log(l-i + ^(*))-(l-i)log(2-i) 

- (1 + V (t)) log (1 + p(t)) - I log (1 - t + cp(t)) I 
*„(*) < (1 -* + ¥>(*)) log (2 -t + p(t))-(l-t)lQg(l-t) 

- (1 + v>(i)) log (1 + ^(t)) + I log (1 - t + I , 

and with our assumptions on ip, these bounds are both integrable. This allows to 
apply the dominated convergence theorem which ends the proof of Lemma 6.7. □ 

If there exists s < T such that <p{s) < — (1 — s) then for n large enough, 
£-n,\nT\ {<p) = +°° an d we set AS T A<p) =00. In the other cases we do not know 
what happens, but as in Gamboa et al. (1999), we will study the exposed points. 
Before, we need another expression of the dual of A^ T j . 

6.1.2. Variational formula. Let us define Ajj T j(</?) = +00 if ip docs not satisfy the 
assumption of Lemma 6.1. The dual of A^ T j is then 

( A [5,t]) V) = SU P {< -A-[o,t\(<P)} ( 6 - 6 ) 

for v e VI . Mimicking the method of Leonard (2000) p. 112-113, we get 

( A [5,t]) M= sup >-A[5 iT] (^)} (6.7) 

where C is the set of continuous functions from [0, T] into R vanishing at 0. Then 
we apply Theorem 5 of Rockafcllar (1971) and get 



dt + 



dv s 
d\i 



r(t, ---!./!) .//'!') 
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where 



g*(t,y) = sup{Xy-g G (t,X)S(X\(-l, o))} , (6.8) 



and r is the recession function : 

r[t ,y)= lim 

Actually, if y < 0, the supremum is achieved for 

A^^-ji-^) (6 ' 9) 

and we have 

g*{t,y) = X°(t, y)y - g° (t, X G (t, y)) 

= -y(l - t) + (1 - t) log(l - t) + t log t - t log(l - e y ) 

= H(l-t|e w ). (6.10) 

If 2/ > 0, p*(i, y) = oo. The recession is now r(i, y) = — (1 — t)y if y < 0, and = oo 
si y > 0. As a result 

g k {t,y) = L°{t,y) , r(t,y)=Lf(t,y). (6.11) 
So we proved the identification ^A|^ T ^ = 7^ T j (recall (6.1)). 

6.1.3. Exponential tightness. In this paragraph and in Section 6.2 we use the func- 
tion defined for 6 > —(1 — T) by 

7Jf(6>) := / g G (t,d)dt. (6.12) 
Jo 

If V^* is equipped with the topology a(V e * 7 Ve), the set 

£ a := {M G : H[0,T] < a} 

is compact according to the Banach-Alaoglu theorem. Now —6^ is a positive 
measure and its total mass is — 9^([0, T} ) = — S„(T). We have then 

p(e£gB ) = p(e«(T) < -a) . 

Now for 6» < 

P(e£(T) < -a) < e^ 9nao Eexp{n 2 /3 / «©n( T )} 
so that, taking logarithm and applying Lemma 6.1 we get, for 9 e (—(1 — T), 0) 

lim sup -L logP (e£(T) < -a) < 0a + Lg(0) . 

n p n 

It remains to let a — > oo and we have proved the exponential tightness. 
Let us note that the restriction T < 1 is crucial in the above proof. 
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6.1.4. Exposed points. Let 1Z be the set of functions from [0,T] into M which are 
positive, continuous and with bounded variation. Let T be the set of those m € V f * 
(identified with A4t as in Leonard (2000)) which arc absolutely continuous and 
whose density p is such that — p e TZ. Let us prove that such a m is exposed, 
with exposing hyperplane / m (i) = X(t,p(t)) (recall (6.9)). Actually we follow the 
method of Gamboa et al. (1999). For fixed t, g*(t, .) is strictly convex on (— oo, 0) 
so that, if z ^ p(t), we have 

g*(t, p(t)) - g*(t, z) < X(t, P {t)){p(t) - z) . 

Let d£ — l(t)dt + ^ the Lebesgue decomposition of some element £ 6 My such 
that -f[QT](£) < 00 • Taking 2; = Z(t) and integrating, we get 

/ g*(t,p(t)dt- f g*(tj(t))dt< f X(t, p(t))p{t)dt - [ X(t,p{t))l{t) dt 
Jo Jo Jo Jo 

and since £ g*(t,l(t))dt = £ L%(t,l(t))dt < 1$ T] (0 this yields 

T\o,T] (™) - T\o,T\ (£) < / /™ rfm - / fmd£ ■ 
Jo Jo 

The following lemma says that this set of exposed points is rich enough. 

Lemma 6.2. Let m e V r such that I^ T ^(m) < 00. There exists a sequence of 
functions l n € TZ such that 

(1) lim„ l n {t)dt = —m in with the a(V e * 7 Ve) topology, 

(2) lim n T^ T] (-l n (t)dt) = T^ T] {m) . 

Proof: The method may be found in Gamboa et al. (1999) and in Dette and Gamboa 
(2007). The only difference is in the topology because we want to recover marginals. 
We will use the basic inequality which holds for every e < : 

LG(t,v + e)<LC(t,v)-e(l-t) (6.13) 

Let m = m a + m s such that Jjp r j(m) < 00. From (3.38) and (3.36) it is clear that 
— m a and — m s must be positive measures. 

First step We assume that m — —l(t)dt — i] with I G i 1 ([0, T\; dt) and r\ a singular 
positive measure. One can find a sequence of non negative continuous functions h n 
such that h n (t)dt — > r\ for the topology a(V^ 7 Ve). Indeed every function ip £ Ve 
may be written as a difference ipi — ip2 of two increasing functions. There exists 
a unique (positive) measure ot\ such that ipi(t) = a\([t,T]) for every t <G [0, T\. 
Moreover, the function g = ry([0, •]) € V r is non decreasing and may be approached 
by a sequence of continuously derivable and non decreasing functions (g n ) such that 
g n < g- Setting h n ■= g' n and u n = h n {t)dt, the dominated convergence theorem 
gives 



W\>f n >=/ i/„([0,t])ai(dt)-» / r?([0,t]K 
Jo Jo 



(dt) 
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With the same result for ipz we get 

(lM„) = / i/„([0,t])ai(dt) - / i/„([0,t])a 2 (dt) 
Jo Jo 

f iy([0,t])ai(dt) - / iy([0,t])a 2 (dt). 
o Jo 

or lim„(^>, j/„) = (ip, n). On the one hand, the lower semi-continuity of ijfi T j yields 

limmf /g T] + h n (t))dt) > I^ T] (m) . 

On the other hand, integrating (6.13) yields 

I\o,T](-(Kt) + hn{t))dt) < J L°(t,-l(t))dt + J {l-t)h n (t)dt 

- £ La(t, -l{t))dt + J\l - t) V (dt) = I^ T] (m) . 

Second step Let us assume that m = —l(t)dt with I e i 1 ([0, T]; dt) and for every 
n, let us set Z„ = max(Z, 1/n). It is clear that as n —* oo, then Z„ J. Z. On the one 
hand the lower semi-continuity gives 

limmf I^ T] (-l n (t)dt) > I^ T] (-l(t)dt) . 
On the other hand, by integration of inequality (6.13), since l n — I < 1/n 

i^ T] (-i n (t)dt) < i° T] (-i(t)dt) + i . 

It is then possible to reduce the problem to the case of functions bounded below. 

Third step Let us assume that m = —l(t)dt with I e i 1 ([0, T]; dt) and bounded 
below by A > 0. One can find a sequence of continuous functions (h n ) with bounded 
variation such that h n > A/2 for every n and such that h n — ► Z a.e. and in 
^([0, T], dt). We have Zi„(t)dt -> Z(t)di in a(F/, V t ) and since •) is uniformly 

Lipschitz on (— oo, — A/2], say with constant k, we get 

\I^ T] (-h n (t)dt)-I^ T] (-l{t)dt)\ <k J \h n (t) - l(t)\dt ^ . 

Actually, h n € TZ and <p n {t) ■= A(f, —h n {t)) satisfies the assumption of Lemma 6.1 
since 

1 + < Pn(t)~t> 1 _ t e _ A/2 . 

6.1.5. End of the proof of Theorem 3.9. The first step is the upper bound for com- 
pact sets. We use Theorem 4.5.3 b) in Dembo and Zeitouni (1998) and the following 
lemma. 

Lemma 6.3. For every 5 > and m G V/* , there exists <ps fulfilling the conditions 
of Lemma 6. 1 and such that 

j tp s dm - A*f (<ps) > min I^ T ^(m) — 5, S^ 1 . (6.14) 
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The second step is the upper bound for closed sets : we use the exponential 
tightness. The third step is the lower bound for open sets. The method is classical 
(see Dembo and Zeitouni (1998) Theorem 4.5.20 c)), owing to Lemma 6.2. 

To prove Lemma 6.3, we start from the definition (6.6) or (6.7). One can find 
<ps <= V( satisfying (6.14). If ipg does not check assumptions of the lemma we add 
e > to <ps which allows to check them and satisfy (6.14) up to a change of S. □ 

6.2. Proof of Theorem 3.10. We use the contraction from the LDP for paths. Since 
the mapping m m([0, T]) is continuous from D to R, the family 6^(T) satisfies 
the LDP with good rate function specified by (3.40): 

I<f(0=mi{I° T] (v) ; v(T)=£}. 

Since the process 0^ takes its values in (— oo, 0] (remember Hadamard inequality), 
it is clear that = 00 f° r £ > 0- 

Fixing £ < 0, we can look for optimal v. Let 9 > — (1 — T) (playing the role of a 
Lagrange multiplier). 

By the duality property (6.8) 

*■(«.£<«)) a »£e>-*°(t.«>- 

Integrating and using (6.1), (6.11) and (6.5) we get 

lg T] (v) > 6v a ([0,T}) - L<f(6) - ( T (l-t) dv s (t), (6.15) 

Jo 

For every v such that v(T) = £ it turns out that 

^[o.t]( u ) > 0£-L$(6)- f (l-t + 0) di) s {t) > 0£-L£(6>). (6.16) 



Besides, from (6.9) the ordinary differential equation 

x G (tA'(t)) = e 
0(0) = o, 

admits for unique solution in C 1 ([0,T]) 

1 1 ► 0(0; i) := J(l + 6)- J(l-t + 6)-t log(l + 0) . 

Now, since 

l^ T ^-h( 1 -n^) + TT^] >0 

we see that the mapping 9 ^> </>(#; T) is bijective from [—(1 — T), oo) onto [— T, 0). 
Moreover, by duality 



5* (t, i)) = 6^(6, t) - 3 G (t, 0) . 



<9r v ' V 9f 
There are two cases. 

• If £ € [— T, 0), there exists a unique 9^ such that </>(#£, T) — £ (i.e. the relation 
(3.41) is satisfied). For ifi := </>($£ , •), we get from (6.1), (6.11) and (6.12) again 

'[o,T](^) = 

so that realizes the infimum in (3.40). A simple computation ends the proof of 
the first statement of Theorem 3.10. 
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Let us note that at the end point £ = — T, we have 
0f = -(l-T) , ^) = J(T)-J(T-t)-ilogT, (vt)'(t) = log(l - t/T) . 
Finally 

7£(-T) = 2T(l-T) + (F(l)-F(l-T)- J F(T)+T 2 logT) 

r(i -r) r 2 io g r (i-r) 2 io g (i-T) 3 

2 + 2 2 + 4' 

• Let us assume £ = — T — e with e > 0. Plugging # = — (1 — T) in (6.16) yields, 
for every u such that v(T) = £ 

I$, T] (v) > -(1 - T)£ - (-(1 - T)) = (1 - T)e + JG(-T) , 

and this lower bound is achieved by the measure v = {v~ T )'(t)dt — eSrit), since 

f LC(t,(v- T y(t))dt = I<?(T) , [ T (l-t)edS T (t) = (l-T)e. 
Jo Jo 

It remains to look at £ = 0. Taking £ = in (6.16), we get 

/t(0) > -L$(6) 

for every 8 > —(1 — T). Now, from (6.4) and (6.12) we may write 

-L%{6) = j\l - t) log(l -t)dt + j\l + 9) log (l - T ^_) dt 

+ / t\og{\ - t + 6) dt . 
Jo 

When 8 tends to infinity, the second term tends to zero and the third, which is 
bounded above by (T 2 /2) log(l - T + 8) tends to infinity. Finally 7^(0) = oo. 
That ends the proof of the second statement of Theorem 3.10. □ 

Remark 6.4. It is possible to try a direct method to get (3.42), (3.43) using Gartner- 
Ellis' theorem (Dembo and Zeitouni (1998), Theorem 2.3.6). From Lemma 6.1 
the limiting n.c.g.f. of 0^(T) is L'f which is analytic on (—(1 — T),oo). When 
8 I —(1 — T), we have (Lj,)'(6) J. — T. We meet a case of so called non steepness. 
To proceed in that direction we could use the method of time dependent change 
of probability (see Dembo and Zeitouni (1995)). We will not give details here. 
Nevertheless, this approach allows to get one-sided large deviations in the critical 
case T = 1. Actually we get 

lim-LlogP(e£(l) > nx) = -I°{x) 

for a; > —1. The value x = — 1 corresponds to the limit of 0^(1). note that the 
second (right) derivative of 7-p at this point is zero (or equivalently lim(Lf)" (8) = 
oo as 8 I 0) , which is consistent with previous results on variance. I do not know 
the rate of convergence to of P(0^(1) < nx) for x < —1. 
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6.3. Proof of Theorem 3.11 and Theorem 3.12. Again, the three routes are possible 
to tackle the problem of large deviations for determinant of Wishart matrices. A 
direct method would use the cumulant generating function from (5.13) and would 
meet computations similar to those seen in the Uniform Gram case. 

To avoid repetitions, we use the decomposition (2.26), drawing benefit from an 
auxiliary study of S n , r - 

Lemma 6.5. The sequence {n^ 1 S n (t),t € [0, 1)}„ satisfies a LDP in the space 
(D, a(D, M<)) in the scale 2[3~ 1 n~ 2 with good rate function 

J lW»>=/ M 5(f(<>)<* + if (f (*))*«) (6.17) 

where 

L s a (y) = ( e y-y-l) , Lf(y) = -y%|(-oo,0)), (6.18) 
and n is any measure dominating dv s . 

Let us stress that the instantaneous rate functions are time homogeneous and 
then we may write [0, 1] instead of [0, 1). 

6.3.1. Proof of Lemma 6.5. It is a route similar to the proof of Theorem 3.9 in 
Section 6.1 (see also Najim (2002)). We start from (2.27) so that 



Withe help of (5.13) this yields 

n 

logEexp < f3'nS n ,j >= ^ 

k=i 

-n£{[3'n) 

if -f(s) + 1 > for every s € [0,1]. A little computation shows that the limiting 
n.c.g.f. is 

£ s ( 7 ) = fj(l+7(tp, (6.19) 
Jo 

which yields (6.18) by duality (see Rockafellar (1971) again). □ 

6.3.2. Proof of Theorem 3.11. Let 6£ = n^ 1 log A^ f3 . We deduce from Lemma 6.5 
and Theorem 3.9 that the sum 0^ = 9^ + ^S n satisfies a LDP in the same scale 
with good rate function 1^ T j DI^ T j where □ denotes the infimum convolution : 

(fOg)(x) = inf{/(a;i) + g(x 2 ) \ x\ + x 2 = x} . 
The two characteristics of the rate function are then 

L L a = M{L^(v)+L s a (u-v)} 

V 

L L S = inf{L?(t,) +£?(«-„)}. 

V 

which yield (3.44) by an explicit computation. □ 



n 

= ^(log4™ ) )^/„- 

3=1 

^/3'n(l+ 7 (^)))-/3'7Qlog(/3'n) 
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Alternatively, it is possible to sum the two n.c.g.f. ((6.5) and (6.19)) and get the 
rate function by duality. We claim : if 7 (s) + 1 > for every s e [0, 1] 

-LlogEexp</3'n 2 e£,7>^ / L (*,7(*)) dt , (6.20) 
P n Jo 

where 

g L (t, 7 ) = g G (t, 7 ) + J(l + 7 ) = J(l - i + 7 ) - J (I - t) . (6.21) 

6.3.3. Proof of Theorem 3.12. We may either use the contraction <d^ i— > 6^(T) 
or establish a LDP for the marginal S n (T) and then perform an inf-convolution. 
We leave the details of the proof to the reader. We just give the expression of the 
optimal path when it exists. 

For 9 > —(1 — T), the function 

t ~ (f>(9; t) := J(l + 9)-J(l-t + 6). 

is in C 1 ([0,T]) and the mapping 9 (f>(9;T) is bijective from [—(1 — T),oo) onto 
[£r,oo), where £ T = J(T)-1. 

Fixing £ > £x, we can look for optimal v. There exists a unique 9^ such that 
4>(9^,T) = £. Then it := <p(9^ , •) is the optimal path (tr realizes the infimum in 
(3.45). Let us note that at the end point £ = £r, we have 

^ = -(1 - T) , vHt) = J(T) - J(T - t) , (««)'(*) = log(T - t) .□ 

Remark 6.6. It is possible to get (3.47), (3.48) using Gartner-Ellis' theorem (Dembo 
and Zeitouni (1998), Theorem 2.3.6). We are in the same situation as in Remark 
6.4. 

This approach allows to get one-sided large deviations in the critical case T = 1 . 
Actually we get 

lim-^logP(logA^(l) > nx) = -7f» 

for x > —1. The value x = — 1 corresponds to the limit. Note that the second 
(right) derivative of if at this point is zero (or cquivalently lim(£f')"(0) = oo as 
9 l 0), which is consistent with previous results on variance. We do not know the 
rate of convergence to of P(log A^ ,/3 (l) < nx) for x < —1. 

6.4. Proof of Theorem 3.13 and Theorem 3.14- We may try again to use the beta- 
gamma algebra, but we do not succeed to go until the end. Let as in Subsection 

5.5, U n and V n be the two Laguerre variables. From the exponential tightness of U n 
and V n , we deduce easily the exponential tightness of W n . From Puhalskii (2001), 
the sequence W n contains subsequences satisfying LDP. If for such a subsequence 
we call P the rate function, the independence gives 

I u = I v DI p 

This equation has many solutions and only one convex solution, which is 

p = jU B jV 

defined by 

(/ B g){x) = sup{/(xi) - g(x 2 ) | x\ - x 2 = x} 
( Mazure and Voile (1991)). But we do not know a priori that I p is convex. 
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We choose to use the beta-gamma trick to study the n.c.g.f. For the remaining 
we do not give details since it is similar to the above cases and again based on the 
ideas of Baldi theorem (Dembo and Zeitouni (1998)) and a variational formula. 

6.4.1. Convergence of the n.c.g.f. Put 6^ = n^ 1 log so that 

fc=i 

and put for T < t\ and ip e Vj : 

£-n,\nT\ if) = ^logEexpn^,^). 
Lemma 6.7. If <p e V^ 1 satisfies ip(s) + n — s > /or every s e (0, T], ffcera 

lim< L "Tj M = Af 0;T] (^) := ^ ¥>(«)) , (6.22) 

where, for 9 + t\ — s > 

/(s,0) =£{n- 8 + 6,^6) . (6.23) 
Proof: From (2.28) we have 

(ne£ 7) + ((ni + n 2 )e^ 1+ „ 2 , 7 ((m + n 2 ) • /n)) - 

L " TJ 

(ni6^,7(ni •/«))+ log ^ ^( fc / n ) 

m + n 2 ^ 

and then, by independence, 

logEexp(/3'n 2 e^,7) = log Eexp(/3'nm 6^,7(711 • /n)) 

-logEcxp(/3'n(ni +n 2 )e^ 1+ „ 2 ,7((ni +n 2 ) • /n)) 

i nT \ 

+nlog ^T^(^ 7(fc/n) 

fe=i 



-^logEcxp(/3'npe^, 7 (p./n))^i f g L ( 1,7^1) d s , (6.24) 

> V T Jq \T T / 



By a slight modification of (6.20) we have, for p/n —* r 

/■T 

so that taking r = n and r = n + t 2 and subtracting, we get 
^LlogEexp^Ve^,^) /(s,7(s)) , 

where 

s 9 



g J(s,6) = ng L (-,-) -( Tl +T 2 )g L (- 



-n ' ti/ \ri + t 2 ' (n + t 2 )- 

Tl 



+0 log ■ 



Tl + T 2 

and this is equivalent to (6.23). □ 
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6.4.2. Duality. Let us define Aj^ T ^p) = +00 if ip does not satisfy the assumption 
of Lemma 6.7. The dual of A/ Q T , is then 

( A fo,T]) (") = SU P {< V > - A fo,T](^)} (6-25) 
for veV e *. Mimicking the method of Leonard (2000) p. 112-113, we get 

( A fo,T])V) = sup{< -Af T] (^)} (6.26) 

where C is the set of continuous functions from [0, T] into R vanishing at 0. Then 
we apply Theorem 5 of Rockafellar (1971). We get 

(At„)* M - [ {</)'{<, + £•■'{'■ £<«)) *w < 6 - 27 > 



where 



(.9 J )*(s,y)-sup{Aj/-/( S ,A)(5(A|(-r 1 ,cx3))} . (6.28) 

This supremum is achieved by 

X J {s,y) = -{T 1 -s) + ^— [ (6.29) 

and we have 

(? J )*{s,y) = X J ( S ,y)y-g J ( S ,X J ( S ,y)) (6.30) 

= (ti+t 2 -s)h( Tl ~ S e y ). (6.31) 
V Tl + T2 — s / 

The recession is r J (s; y) = — (n — s)y if y < 0. 

6.4.3. Proof of Theorem 3.10. We use the contraction from the LDP for paths. 
Since the mapping m m([0, T]) is continuous from D to R, the family {6;{(T)}„ 
satisfies the LDP with good rate function given by (3.50). Since the process 9 n 
takes its values in (— oo,0] , it is clear that 1^(0 = oo for £ > 0. Fixing £ < 0, 
we can look for optimal v, i.e. a path (v(t),t e [0, T}) such that w(T) = £ and w 
achieves the infimum in (3.50). Fix 8 > T — t\ (playing the role of a Lagrange 
multiplier). In view of (6.27), (6.28) and (6.29), it is clear that (in the generic case) 
the Euler-Lagrange equation is 

x J ( s ^'( s )) = e 
0(0) = o. 

This ordinary differential equation admits for unique solution in C 1 ([0, T}) 

:= £(0 + ti,75,*). 
To know if the path 4> J may have £ as its terminal value, look at 

£'(0 + 7l,T 2 ,T) = JL S ( T} t 2 ,T)\ t= 8 +Ti 

= log(l- T ) - log (l - -j— ) ; 
since it is positive, we see that the mapping 

^ £(0 + 71, 72, T) 
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is continuous and increasing from [T — ri, oo) onto T>t = 0). If £ € 0), we 
call 0£ the unique solution of <p J (9, T) = £ or in other words, 

£(0£+ Ti ,t 2 ,T) = £, 

and we set := </> J (% , •)• 

To end the proof, let us now consider some inequalities. The duality property 
(6.28) gives, for every v and t 

(■«')*('• <« 2 > 

Setting 

: = /* g J (t,6) dt , (6.33) 
Jo 

integrating (6.32) and using (3.35), (3.49) and (6.33) we get 

I[o,T](v) > 9ia([0,T\) - L J T {6) - [ (n - t) dv s (t) . 

Jo 

For every v such that v(T) = £ it turns out that 

ll T] {v)>6^-L J T {6)- [ (n -T + 0) d* a (t) > 6Z-L J T {6). (6.34) 
Jo 

There are three cases. 
• If f e [#,0) , we get 

so that realizes the infimum in (3.50). A simple computation leads to (3.42) 

which ends the proof of the first statement of Theorem 3.10. 

cJ 

T-t 



Let us note that at the end point £ = we have 



9 i = {T- n) , v*(t) - £{T, r 2 , t) , = log ■ 



Tl + T 2 - i 

• Let us assume i — ^ — s with e > 0. Plugging = T — T\ in (6.34) yields, for 
every v such that u(T) = £ 

^, T ] («) > (T - n)# - L^(T - n) - e(T - n) = - e(T - n) , 

and this lower bound is achieved by the measure v = (t)dt — eddrit), since 

'4(t, (t))dt = Ifai) , / (n - t)ed5 r (t) = (n - T)e . 



T 

Li 



• It remains to look at £ = 0. Taking £ = in (6.34), we get /fo )T] (0) > ~L^{9) 
for every > T — t\. Now, from (6.23) and (6.33), we may write (after some 
calculation) 

-L J T (9)= [ -E{T 1 -t + 6,T 2 ,6)dt =[[ logfl + ^ )ds 



JO 



n - t + s / 



which tends to infinity as 9 — * oo. We conclude T j(0) = oo. □ 



> J I io S (:i + ^^) cZ«. 
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7. Proofs of Theorems of Section 4 

7.1. Proof of Proposition J^.l. Let 9 e R be a Lagrangian factor. We begin by 
minimizing 



It pL (v) -0T J log a; dfi(x) = T 2 -E(/i) + 2y g A , a (a;) d/i(x) 



+ 2B(T) 



where 



2 ^ T -\- 9 

q\, s (x) = Ax- slogx, A=— , s = — . (7.1) 



In the book of Saff and Totik (1997) p. 43 example 5.4, it is stated that for A > 
and 2s + 1 > fixed, the infimum 



inf -S( / u) + 2 J qx. s {x) dfj,(x) 



is achieved by the unique extremal measure with 

2s + 1 1 



a 2 = — — — , c = 



2A ' 2s + 1 
We see from (7.1) that if 9 > — 1 we can take: 

T T 



a 2 = l 



Now it remains to look for 9 such that the constraint fj, e A(£/T) is saturated. 
Since 

jlo gxd<2 ( X )=lo g a 2 + jlo gxd ^ X )d X , 
and thanks to (4.6) we see that 9 must satisfy 



2 rrJ^-c) 



i = T\o g a z - T—^— — '- = J(l + 9) - J(l - T + 9) , 

which is exactly exactly (3.46). 

To compute i^ pL (7r£ 2 ), we start from the definition (4.8): 

I s T pL {*U) = -T 2 £« 2 ) + T J (x — (1 — T) logx) d< 2 (x) + 2B(T) , 

and transform ir^ 2 to using the dilatation. In particular, (4.9) yields 

£« 2 )=lo g( 7 2 + £K) 
and S(7rJ) may be picked from formula (13) p. 10 in Hiai and Petz (1998) : 

SK) = -l + i(c- 1 +logc +(c- 1 -l) 2 log(l-c)) . 

Besides we have easily J x dnf(x) = 1. After some tedious but elementary compu- 
tations we get exactly the RHS of (3.47), which yields 

and ends the proof of (4.12). □ 
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7.2. Proof of proposition 4-6- Let < t — n (Lagrangian multiplier). We begin by 
minimizing 



I s t pJ {n) - Ot I logx dfi(x) 



(7.2) 



-E(/x)-2Ci y loga;d/i(a;)-2C2 ^ Iog(l - a;) d/i(x) 



C 



where 



2Ci = 



Tl+ ^ t , 2 C2 = ^ andC = i 2 i?( 



Tl —t T2 — t 



t ' " t 

We use the following lemma. 

Lemma 7.1. For Q\,C,2 > 0, i/ie infimum of 

-E(/i)-2Ci y logxd M (x)-2C 2 ^ log(l - a;) d/i(ar) 

among the probability measures /i on [0, 1] is achieved by 7r _ j0+ where 

(a_,a+) = A ± (s_,s + ) 

l + 2Ci 



)■ 



_ 1 + 2Ci + 2C 2 

2(i + Ci + C 2 ) ' s+ 2(1 + Ci + C 2 ) 
The infimum in (7.2) is achieved by 7T£ -, where 



= A±(s_,s+) , s_ 



Tl + _ Tl + + T 2 - t 



Tl + T 2 + 



Tl + T 2 + 



It should be clear that 



n+6-t t 2 



and then, on A(^T~ X ) the infimum is uniquely realized in - and its value is 

'T\-t T2 — t\ _ / Tl + - t t 2 - t > 



t 7 t J V i 
Finally a small computation leads to (3.52) and (4.31). 

Proo/ of Lemma 7.1 In Saff and Totik (1997) p. 241, it is proved that the infimum 
of 

J J ~ log \x - y\d/j,(x)dn(y) - 2d J log(l - x) dfj,(x) - 2( 2 J log(l + x) d/i(x) 
among the probability measures /x on [—1, +1] is achieved by 

d/i(tf) = K(b. , 6+ ) ^H^pZM i [6 _ >b+] (y) dy , 
where b± = d\ - 6\ ± \/A with 

^ = 7— r 1 — - , i = 1,2 , A = [1 - (0! + 2 ) 2 ] [1 - (0i - 2 ) 2 ] . 

1 + SI + S2 

and K(b-,b + ) is a normalizing constant. With the push-forward by the function 
x — > (x + l)/2, we get the result. □ 
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8. Appendix 1 : Some properties of I = logT and 

From the Binet formula ( Abramowitz and Stegun (1972) pp. 258-259 or Erdelyi 
et al. (1981) p.21), we have 



(a;) = (x- l)\ogx-x + l+ f f( S )[e- sx - e - s }ds 
* Jo 

= (x-±)\ogx-x + Uog(2n) + f(s)e- sx ds. 



where the function / is defined by 



f(s) = 



1 1 1 

- - + 



2 s e s - 1 
and satisfies for every s > 



1 00 1 
L = 2 V - 

.e .<j2 i A n 



S 2 + A7T 2 k 2 
k=l 



< /(*) < /(0) = 1/12 , < ( sf(s) + 1 ) < 1 . 



By differentiation 
lo. 



gx - - ^ + jf °° S .f(s)e- sx ds = £ e~ sx (sf(s) + ds . 



As easy consequences, we have, for every x > 

< x (logx - $(x)) < 1 , 

< logs - ^(x) - ^- < ' 



2x - 12a; 2 ' 
Differentiating again we see that for q > 1 

* (?) (z) = (-l) q - 1 q\z- q + (-l) q - 1 e- sz s q (sf(s) + ^ ds 

and then 

|*^(z) - (-1)9-Vz-«| < z" 9 " 1 ^ . 
9. Appendix 2 : Identification of the McKay distribution 



8.1) 
8.2) 

8.3) 

8.4) 

8.5) 

8.6) 
8.7) 

8.8) 
.9) 



The reader is recalled that, for u' and v' positive numbers 4 such that u' + v' > 1, 
Capitaine and Casalis (2004) defined the probability measure 

CC U> / := (1 - u')+5 + (1 - v')+Si + [1 - (1 - u')+ - (1 - v')+] ir a _, a+ , 

where 



(a_, a + ) = a± ( , 1 \ . 

Vti' + t)' u + v ) 



We present now three identifications of this distribution connected with free prob- 
ability. 

For k 0, let the dilatation operator by factor k. For p < 1, let b p denote the 
Bernoulli distribution of parameter p. At last, let ffl (resp. Kl) denote the additive 
(resp. multiplicative) free convolution. 

1) Rewriting the distribution with the notation of Demni (2006), we find four cases 



4 we use the symbol v' (hence u') not to confuse with f3 already defined. 
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• Situation I : mm(u',v') > 1, no Dirac mass 



u' 1 , cr_ , 1 — <7_ 



M' + IT It' + IT 1 — (T + 1 — (T + 



CC,-,„- = 7r a _ !a+ = £>i_ CT+ (b CT _) ^ 

• Situation 77 : u' < 1 < v' , one Dirac mass at 

1 m' , 1 - a+ , o-+ 

= — 7 , cr+ = 1 — 7 , U = , V = 

u' +v' W + v' er_ er_ 

CC u >y = (1 - u')S + u'ir a _^ a+ 
= ^_(b 1 _ CT+ ) ffl - 

• Situation 777 : v' < 1 < u', one Dirac mass at 1 

1 u' , g+ / 1 - g+ 

?/ + v' W + v' 1 — er_ 1 — er_ 

CC u iy = {1- v')5i +v'lT a _ ia+ 



= Dr_ CT _(b a+ ) 1 -- 
• Situation TV : max(u',i/) < 1, two Dirac masses (at and at 1) 
vl 1 . 1 — <t_ , cr_ 

<T_ = 1 - , cr+ = , U = , U = 

?i' + 1/ u' + l>' cr + er + 

CC u -y = (l-tt% + (l-«')<Sl + («'+^'-l)fa_,« + 



2) There is a connection with the family of free Meixner law (Bozejko and Bryc 
(2005), Bryc and Ismail (2006), Bryc and Ismail (2005)). Indeed, computing the 
mean to and variance V of the distribution CC u iy, we get 



Situation 


m 


V 


I 


(7_ 


CT_(l-<7_)(l-<7+) 


II 




cr_(7 + (l - C7+) 


III 


°+ 


(l-a_)<7 + (l-<7 + ) 


IV 


1-C7_ 


(7_<7 + (l — (7_) 



so that, in all cases 

u' u'v' 
~ u' + v' ' ~~ (u' + w') 3 ' 

We see that fixing u' + v' = s _1 , wc get V = s 2 to(1 — to), and then up to an 
affine transformation we find the "free binomial type law" as in Bryc and Ismail 
(2005) example vi p. 18 or Bozejko and Bryc (2005) example 6 p. 8. It could also be 
seen starting from the above formulae using dilatations and free convolutions and 
comparing with formula (7) page 6 in Bozejko and Bryc (2005). 
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3) Finally, we quote the correspondence with the results of Collins (2005) who 
claimed that for < p- < p+ < 1 

b p _ H b p+ = (1 - p-)6 + (p_ +p + - 1)+5 1 + C-^ a+ Tr a _, a+ 

where a± = a±(l — p-,p + ). In Hiai and Pctz (2006), formula (2.8) the authors 
consider the same distribution. 

• Situation II : p_ + p + — 1 < 0, cr_ = p + , a + = 1 — p_ 

b p _ M b p+ = <7 + 5 + (1 - er + )7r Q _ iQ+ 
= o-CC ul . v , + (1 -<t-)S 

• Situation TV : p- + p+ — 1 > 0, ct_ = 1 — p- , <j+ = p+ 

bp_Klb p+ = o-_(5o + (<t+ - o-)5i + (1 - cr+)7r Q _, a+ 
= a+CC u ,y + (1 - <t+)6 ■ □ 
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